TY - JOUR
T1 - Multi-Scale Oriented Object Detection With Focus Error Ellipse Loss
AU - Gao, Pengfei
AU - Lu, Xuanbei
AU - Li, Ke
AU - Cheng, Gong
AU - You, Xiong
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - The loss function and feature extraction framework are essential parts of the algorithm design and significantly affect the accuracy of oriented object detection in remote sensing images. Though considerable progress has been made, there are still challenges left to be explored, e.g., large variations in scales, arbitrary direction, and dense distribution of the objects, which may have some undesirable effects, such as inaccurate object position regression, high false alarm, and miss rate. To address the above problems, we propose a focus error ellipse (FEE) loss function. This function bolsters the detection accuracy by narrowing the distance between the center points of the labeled and predicted bounding boxes based on the error ellipse. For the network part, we carefully crafted two unit modules: a fine-grained and context-augmented module (FCM) and a semantic information regrouping module (SIRM). The FCM aligns fine-grained information with contextual information to establish dependencies between local and global features, which helps to grasp more holistic characteristics of objects. The SIRM reorganizes the acquired deep semantic features in the channel dimension, enhances the weight of task-beneficial semantic information, and further derives the optimal combination method of feature subsets for object detection. Based on the aforementioned work, we developed an oriented object detection framework, which further improves the detection accuracy of large aspect ratio objects and dense scenes. The experimental results show that the proposed method can produce competitive performance in oriented object detection compared to other state-of-the-art models.
AB - The loss function and feature extraction framework are essential parts of the algorithm design and significantly affect the accuracy of oriented object detection in remote sensing images. Though considerable progress has been made, there are still challenges left to be explored, e.g., large variations in scales, arbitrary direction, and dense distribution of the objects, which may have some undesirable effects, such as inaccurate object position regression, high false alarm, and miss rate. To address the above problems, we propose a focus error ellipse (FEE) loss function. This function bolsters the detection accuracy by narrowing the distance between the center points of the labeled and predicted bounding boxes based on the error ellipse. For the network part, we carefully crafted two unit modules: a fine-grained and context-augmented module (FCM) and a semantic information regrouping module (SIRM). The FCM aligns fine-grained information with contextual information to establish dependencies between local and global features, which helps to grasp more holistic characteristics of objects. The SIRM reorganizes the acquired deep semantic features in the channel dimension, enhances the weight of task-beneficial semantic information, and further derives the optimal combination method of feature subsets for object detection. Based on the aforementioned work, we developed an oriented object detection framework, which further improves the detection accuracy of large aspect ratio objects and dense scenes. The experimental results show that the proposed method can produce competitive performance in oriented object detection compared to other state-of-the-art models.
KW - Fine-grained and context-augmented module (FCM)
KW - focus error ellipse (FEE)
KW - multi-scale objects
KW - oriented object detection
KW - remote sensing
UR - https://www.scopus.com/pages/publications/105018217662
U2 - 10.1109/TGRS.2025.3618618
DO - 10.1109/TGRS.2025.3618618
M3 - 文章
AN - SCOPUS:105018217662
SN - 0196-2892
VL - 63
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5528813
ER -