Skip to main navigation Skip to search Skip to main content

Learning Discriminative Representation for Fine-Grained Object Detection in Remote Sensing Images

  • Northwestern Polytechnical University Xian

Research output: Contribution to journalArticlepeer-review

38 Scopus citations

Abstract

Fine-grained object detection (FGOD) in remote sensing images is an emerging and challenging task in the field of image intelligent interpretation. It aims to localize objects while classifying them into different fine-grained categories. Modern FGOD methods are mainly derived from well-developed detectors and have made compelling progress. Despite this, these methods struggle to perform well in classifying objects at the subordinate level due to the limitations of their representation manners. In this paper, we propose a network capable of learning discriminative representation (DR) for fine-grained object detection in remote sensing images, named DRNet. First, a fine-grained branch that works in parallel with other task branches is introduced, where objects’ features are re-encoded with dual refinement to generate discriminative representation, enabling accurate fine-grained classification. Second, we design a confusion-minimized loss that automatically scales loss contributions according to the separability of samples to train the fine-grained branch, further boosting discriminative ability of the representation and better addressing hard-to-distinguish objects. Moreover, we devise an interaction verification strategy that empowers the network to fully utilize the results of fine-grained classification and coarse classification for achieving robust inference. On large-scale FAIR1M-1.0 and FAIR1M-2.0 datasets, our DRNet with ResNet50 and 1× training schedule obtains 40.87% mAP and 47.04% mAP, respectively, establishing new state-of-the-arts for fine-grained object detection in remote sensing images.

Original languageEnglish
Pages (from-to)8197-8208
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume35
Issue number8
DOIs
StatePublished - 2025

Keywords

  • Fine-grained object detection
  • confusion-minimized loss
  • discriminative representation learning
  • fine-grained branch

Fingerprint

Dive into the research topics of 'Learning Discriminative Representation for Fine-Grained Object Detection in Remote Sensing Images'. Together they form a unique fingerprint.

Cite this