Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images

Xiwen Yao; Xiaoxu Feng; Gong Cheng; Junwei Han; Lei Guo

doi:10.1109/IGARSS.2019.8899285

Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images

Xiwen Yao, Xiaoxu Feng, Gong Cheng, Junwei Han, Lei Guo

School of Automation

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

5 Scopus citations

Abstract

Object detection in very high resolution (VHR) optical remote sensing images is a fundamental yet challenging problem for the field of remote sensing image analysis. The detection performance is heavily dependent on the representation capability of the extracted features. Recently, convolutional neural networks (CNNs) have made a breakthrough for various applications in nature images. However, it is problematic to directly apply CNN to perform object detection in VHR optical remote sensing images due to the problem of object rotation variations. To address this issue, a novel rotation invariant probabilistic Latent Semantic Analysis (RI-pLSA) model is proposed to learn latent semantic representations for object detection. This is achieved by imposing a rotation-invariant regularization term on the objective function of pLSA to enforce the learned representation from all rotations of the same sample to be as consistent as possible. Additionally, the proposed RI-pLSA model takes the CNN features as input, which generates more powerful semantic representation for object detection. Comprehensive experiments on a publicly available ten-class object detection dataset demonstrate the superiority and effectiveness of our method compared with state-of-the-arts.

Original language	English
Title of host publication	2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1382-1385
Number of pages	4
ISBN (Electronic)	9781538691540
DOIs	https://doi.org/10.1109/IGARSS.2019.8899285
State	Published - Jul 2019
Event	39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Yokohama, Japan Duration: 28 Jul 2019 → 2 Aug 2019

Publication series

Name	International Geoscience and Remote Sensing Symposium (IGARSS)

Conference

Conference	39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019
Country/Territory	Japan
City	Yokohama
Period	28/07/19 → 2/08/19

Keywords

convolutional neural networks (CNNs)
Object detection
remote sensing images
rotation invariant probabilistic Latent Semantic Analysis (pLSA)

Access to Document

10.1109/IGARSS.2019.8899285

Cite this

Yao, X., Feng, X., Cheng, G., Han, J., & Guo, L. (2019). Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images. In 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings (pp. 1382-1385). Article 8899285 (International Geoscience and Remote Sensing Symposium (IGARSS)). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IGARSS.2019.8899285

Yao, Xiwen ; Feng, Xiaoxu ; Cheng, Gong et al. / Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images. 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 1382-1385 (International Geoscience and Remote Sensing Symposium (IGARSS)).

@inproceedings{780a06bbc7444547a156dfc903e5ae36,

title = "Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images",

abstract = "Object detection in very high resolution (VHR) optical remote sensing images is a fundamental yet challenging problem for the field of remote sensing image analysis. The detection performance is heavily dependent on the representation capability of the extracted features. Recently, convolutional neural networks (CNNs) have made a breakthrough for various applications in nature images. However, it is problematic to directly apply CNN to perform object detection in VHR optical remote sensing images due to the problem of object rotation variations. To address this issue, a novel rotation invariant probabilistic Latent Semantic Analysis (RI-pLSA) model is proposed to learn latent semantic representations for object detection. This is achieved by imposing a rotation-invariant regularization term on the objective function of pLSA to enforce the learned representation from all rotations of the same sample to be as consistent as possible. Additionally, the proposed RI-pLSA model takes the CNN features as input, which generates more powerful semantic representation for object detection. Comprehensive experiments on a publicly available ten-class object detection dataset demonstrate the superiority and effectiveness of our method compared with state-of-the-arts.",

keywords = "convolutional neural networks (CNNs), Object detection, remote sensing images, rotation invariant probabilistic Latent Semantic Analysis (pLSA)",

author = "Xiwen Yao and Xiaoxu Feng and Gong Cheng and Junwei Han and Lei Guo",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 ; Conference date: 28-07-2019 Through 02-08-2019",

year = "2019",

month = jul,

doi = "10.1109/IGARSS.2019.8899285",

language = "英语",

series = "International Geoscience and Remote Sensing Symposium (IGARSS)",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1382--1385",

booktitle = "2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings",

}

Yao, X, Feng, X, Cheng, G , Han, J & Guo, L 2019, Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images. in 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings., 8899285, International Geoscience and Remote Sensing Symposium (IGARSS), Institute of Electrical and Electronics Engineers Inc., pp. 1382-1385, 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019, Yokohama, Japan, 28/07/19. https://doi.org/10.1109/IGARSS.2019.8899285

Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images. / Yao, Xiwen; Feng, Xiaoxu; Cheng, Gong et al.
2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 1382-1385 8899285 (International Geoscience and Remote Sensing Symposium (IGARSS)).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images

AU - Yao, Xiwen

AU - Feng, Xiaoxu

AU - Cheng, Gong

AU - Han, Junwei

AU - Guo, Lei

PY - 2019/7

Y1 - 2019/7

N2 - Object detection in very high resolution (VHR) optical remote sensing images is a fundamental yet challenging problem for the field of remote sensing image analysis. The detection performance is heavily dependent on the representation capability of the extracted features. Recently, convolutional neural networks (CNNs) have made a breakthrough for various applications in nature images. However, it is problematic to directly apply CNN to perform object detection in VHR optical remote sensing images due to the problem of object rotation variations. To address this issue, a novel rotation invariant probabilistic Latent Semantic Analysis (RI-pLSA) model is proposed to learn latent semantic representations for object detection. This is achieved by imposing a rotation-invariant regularization term on the objective function of pLSA to enforce the learned representation from all rotations of the same sample to be as consistent as possible. Additionally, the proposed RI-pLSA model takes the CNN features as input, which generates more powerful semantic representation for object detection. Comprehensive experiments on a publicly available ten-class object detection dataset demonstrate the superiority and effectiveness of our method compared with state-of-the-arts.

AB - Object detection in very high resolution (VHR) optical remote sensing images is a fundamental yet challenging problem for the field of remote sensing image analysis. The detection performance is heavily dependent on the representation capability of the extracted features. Recently, convolutional neural networks (CNNs) have made a breakthrough for various applications in nature images. However, it is problematic to directly apply CNN to perform object detection in VHR optical remote sensing images due to the problem of object rotation variations. To address this issue, a novel rotation invariant probabilistic Latent Semantic Analysis (RI-pLSA) model is proposed to learn latent semantic representations for object detection. This is achieved by imposing a rotation-invariant regularization term on the objective function of pLSA to enforce the learned representation from all rotations of the same sample to be as consistent as possible. Additionally, the proposed RI-pLSA model takes the CNN features as input, which generates more powerful semantic representation for object detection. Comprehensive experiments on a publicly available ten-class object detection dataset demonstrate the superiority and effectiveness of our method compared with state-of-the-arts.

KW - convolutional neural networks (CNNs)

KW - Object detection

KW - remote sensing images

KW - rotation invariant probabilistic Latent Semantic Analysis (pLSA)

UR - http://www.scopus.com/inward/record.url?scp=85077699148&partnerID=8YFLogxK

U2 - 10.1109/IGARSS.2019.8899285

DO - 10.1109/IGARSS.2019.8899285

M3 - 会议稿件

AN - SCOPUS:85077699148

T3 - International Geoscience and Remote Sensing Symposium (IGARSS)

SP - 1382

EP - 1385

BT - 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019

Y2 - 28 July 2019 through 2 August 2019

ER -

Yao X, Feng X, Cheng G , Han J , Guo L. Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images. In 2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 1382-1385. 8899285. (International Geoscience and Remote Sensing Symposium (IGARSS)). doi: 10.1109/IGARSS.2019.8899285

Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this