跳到主要导航 跳到搜索 跳到主要内容

Incorporating Multiscale Context and Task-Consistent Focal Loss into Oriented Object Detection

  • Zhengzhou University of Light Industry

科研成果: 期刊稿件文章同行评审

23 引用 (Scopus)

摘要

Oriented object detection (OOD) in remote sensing images (RSIs) aims to precisely localize and identify objects with arbitrary orientations. Two-stage OOD methods attract lots of interest due to their superior accuracy; however, they still face two major problems. First, the misclassification problem frequently occurs because the majority of classification strategies solely rely on the features of proposals. Second, most loss functions cannot simultaneously concentrate on hard samples and boost the consistency between identification and localization, which restricts the further improvement of OOD models. To address the first problem, multiscale contextual information is incorporated into a two-stage OOD model in this article. Specifically, N contextual branches are added to predict the class confidence score (CCS) of each proposal and its N enlarged proposals which include multiscale context, and the final CCS of each proposal is determined by the mean value of the above N + 1 CCSs. To tackle the second problem, a task-consistent focal (TF) loss is proposed. The TF loss employs the difficulty of localization as the weight of classification loss, and the difficulty of identification is used as the weight of regression loss. Concentrating on hard samples and synchronous optimization of classification and regression can be achieved by minimizing the TF loss. The ablation studies show the validity of the contextual information, TF, and their combination. The comparison with popular OOD models demonstrates the superior performance of our model on the DOTA and DIOR-R datasets.

源语言英语
文章编号5628411
期刊IEEE Transactions on Geoscience and Remote Sensing
63
DOI
出版状态已出版 - 2025

指纹

探究 'Incorporating Multiscale Context and Task-Consistent Focal Loss into Oriented Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此