TY - JOUR
T1 - Global to Local
T2 - A Scale-Aware Network for Remote Sensing Object Detection
AU - Gao, Tao
AU - Niu, Qianqian
AU - Zhang, Jing
AU - Chen, Ting
AU - Mei, Shaohui
AU - Jubair, Ahmad
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - With the wide application of remote sensing images (RSIs) in military and civil fields, remote sensing object detection (RSOD) has gradually become a hot research direction. However, we observe two main challenges for RSOD, namely, the complicated background and the small objects issues. Given the different appearances of generic objects and remote sensing objects, the detection algorithms designed for the former usually cannot perform well for the latter. We propose a novel global-to-local scale-aware detection network (GLSANet) for RSOD, aiming to solve the abovementioned two challenges. First, we design a global semantic information interaction module (GSIIM) to excavate and reinforce the high-level semantic information in the deep feature map, which alleviates the obstacles of complex background on foreground objects. Second, we optimize the feature pyramid network to improve the performance of multiscale object detection in RSIs. Finally, a local attention pyramid (LAP) is introduced to highlight the feature representation of small objects gradually while suppressing the background and noise in the shallower feature maps. Extensive experiments on three public datasets demonstrate that the proposed method achieves superior performance compared with the state-of-the-art detectors, especially on small object detection datasets. Specifically, our algorithm reaches 94.57% mean average precision (mAP) on the NWPU VHR-10 dataset, 95.93% mAP on the RSOD dataset, and 77.9% mAP on the DIOR dataset.
AB - With the wide application of remote sensing images (RSIs) in military and civil fields, remote sensing object detection (RSOD) has gradually become a hot research direction. However, we observe two main challenges for RSOD, namely, the complicated background and the small objects issues. Given the different appearances of generic objects and remote sensing objects, the detection algorithms designed for the former usually cannot perform well for the latter. We propose a novel global-to-local scale-aware detection network (GLSANet) for RSOD, aiming to solve the abovementioned two challenges. First, we design a global semantic information interaction module (GSIIM) to excavate and reinforce the high-level semantic information in the deep feature map, which alleviates the obstacles of complex background on foreground objects. Second, we optimize the feature pyramid network to improve the performance of multiscale object detection in RSIs. Finally, a local attention pyramid (LAP) is introduced to highlight the feature representation of small objects gradually while suppressing the background and noise in the shallower feature maps. Extensive experiments on three public datasets demonstrate that the proposed method achieves superior performance compared with the state-of-the-art detectors, especially on small object detection datasets. Specifically, our algorithm reaches 94.57% mean average precision (mAP) on the NWPU VHR-10 dataset, 95.93% mAP on the RSOD dataset, and 77.9% mAP on the DIOR dataset.
KW - Local attention pyramid (LAP)
KW - multiscale object detection
KW - remote sensing images (RSIs)
KW - semantic information
KW - small object detection
UR - http://www.scopus.com/inward/record.url?scp=85165271468&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2023.3294241
DO - 10.1109/TGRS.2023.3294241
M3 - 文章
AN - SCOPUS:85165271468
SN - 0196-2892
VL - 61
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5615614
ER -