TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images

Research output: Contribution to journalArticlepeer-review

94 Scopus citations

Abstract

Weakly supervised object detection (WSOD) in remote sensing images (RSI) plays an essential role in RSI understanding applications. Currently, predominant works are inclined to first activate the most discriminative region and then pursue the whole object by analyzing the context information of the activated region. However, the most discriminative region usually only covers a small crucial part. Besides, many same-class instances often appear in adjacent locations. In such a case, treating proposals of large spatial overlap as the same-class instances not only introduces potential ambiguities but also misleads the detection model to recognize multiple adjacent instances as one object instance. To address these challenges, a novel triple context-aware network (TCANet) is proposed to learn complementary and discriminative visual patterns for WSOD in RSIs. Specifically, a global context-aware enhancement (GCAE) module is first designed to activate the features of the whole object by capturing the global visual scene context. Then, a dual-local context residual (DLCR) module is further developed to capture the instance-level discriminative cues by leveraging the semantic discrepancy of the local context. Furthermore, an effective adaptive-weighted refinement loss is integrated into the DLCR module to reduce the ambiguities in the label propagating process. The collaboration of GCAE and DLCR formulates a unique TCANet that can be learned in an end-to-end manner. Comprehensive experiments are carried out on the challenging NWPU VHR-10.v2 and DIOR data sets. We achieve a 58.8% mAP and a 25.8% mAP on the NWPU VHR-10.v2 and DIOR data sets, respectively, which both significantly outperform the state of the arts.

Original languageEnglish
Article number9239339
Pages (from-to)6946-6955
Number of pages10
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume59
Issue number8
DOIs
StatePublished - Aug 2021

Keywords

  • Context-aware network
  • remote sensing images (RSIs)
  • weakly supervised object detection (WSOD)

Fingerprint

Dive into the research topics of 'TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images'. Together they form a unique fingerprint.

Cite this