TY - JOUR
T1 - Remote Sensing Small Object Detection Based on Multicontextual Information Aggregation
AU - Wang, Jingyu
AU - Ma, Mingrui
AU - Huang, Pengfei
AU - Mei, Shaohui
AU - Zhang, Liang
AU - Wang, Hongmei
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Due to wide field of view and background confusion, remote sensing objects are small and densely packed, hence commonly used detection methods detecting small objects are not satisfactory. In this article, we propose the multicontextual information aggregation YOLO (MCIA-YOLO) method, combining three novel modules to effectively aggregate multicontextual information across channels, depths, and pixels. First, the channel-spatial information aggregation module assembles spatial global features pursuant to channel contextual information, increasing the density of key information. Second, the shallow-deep information sparse aggregation module applies a sparse cross self-attention mechanism. By sparsely correlating long-range dependency information across different regions, the representation capability of a small target is enhanced while removing redundant information. Third, to enrich local multiscale features and better identify dense targets, multiscale weighted aggregation module convolves multireceptive field information and performs weighted fusion. Our method demonstrates satisfactory performance on dataset VisDrone2019, UAVDT, and NWPU VHR-10, especially in small objects detection, surpassing several state-of-the-art methods.
AB - Due to wide field of view and background confusion, remote sensing objects are small and densely packed, hence commonly used detection methods detecting small objects are not satisfactory. In this article, we propose the multicontextual information aggregation YOLO (MCIA-YOLO) method, combining three novel modules to effectively aggregate multicontextual information across channels, depths, and pixels. First, the channel-spatial information aggregation module assembles spatial global features pursuant to channel contextual information, increasing the density of key information. Second, the shallow-deep information sparse aggregation module applies a sparse cross self-attention mechanism. By sparsely correlating long-range dependency information across different regions, the representation capability of a small target is enhanced while removing redundant information. Third, to enrich local multiscale features and better identify dense targets, multiscale weighted aggregation module convolves multireceptive field information and performs weighted fusion. Our method demonstrates satisfactory performance on dataset VisDrone2019, UAVDT, and NWPU VHR-10, especially in small objects detection, surpassing several state-of-the-art methods.
KW - Global contextual information
KW - multicontextual information
KW - multireceptive field enhancement
KW - small object detection
KW - sparse cross self-attention
UR - http://www.scopus.com/inward/record.url?scp=105001691657&partnerID=8YFLogxK
U2 - 10.1109/JSTARS.2025.3543189
DO - 10.1109/JSTARS.2025.3543189
M3 - 文章
AN - SCOPUS:105001691657
SN - 1939-1404
VL - 18
SP - 8248
EP - 8260
JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
ER -