Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Object detection in very high resolution (VHR) optical remote sensing images is a fundamental yet challenging problem for the field of remote sensing image analysis. The detection performance is heavily dependent on the representation capability of the extracted features. Recently, convolutional neural networks (CNNs) have made a breakthrough for various applications in nature images. However, it is problematic to directly apply CNN to perform object detection in VHR optical remote sensing images due to the problem of object rotation variations. To address this issue, a novel rotation invariant probabilistic Latent Semantic Analysis (RI-pLSA) model is proposed to learn latent semantic representations for object detection. This is achieved by imposing a rotation-invariant regularization term on the objective function of pLSA to enforce the learned representation from all rotations of the same sample to be as consistent as possible. Additionally, the proposed RI-pLSA model takes the CNN features as input, which generates more powerful semantic representation for object detection. Comprehensive experiments on a publicly available ten-class object detection dataset demonstrate the superiority and effectiveness of our method compared with state-of-the-arts.

Original languageEnglish
Title of host publication2019 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1382-1385
Number of pages4
ISBN (Electronic)9781538691540
DOIs
StatePublished - Jul 2019
Event39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019 - Yokohama, Japan
Duration: 28 Jul 20192 Aug 2019

Publication series

NameInternational Geoscience and Remote Sensing Symposium (IGARSS)

Conference

Conference39th IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2019
Country/TerritoryJapan
CityYokohama
Period28/07/192/08/19

Keywords

  • convolutional neural networks (CNNs)
  • Object detection
  • remote sensing images
  • rotation invariant probabilistic Latent Semantic Analysis (pLSA)

Fingerprint

Dive into the research topics of 'Rotation-Invariant Latent Semantic Representation Learning for Object Detection in VHR Optical Remote Sensing Images'. Together they form a unique fingerprint.

Cite this