TY - GEN
T1 - ADVANCED OBJECT DETECTION IN MULTIBEAM FORWARD-LOOKING SONAR IMAGES USING LINEAR CROSS-ATTENTION TECHNIQUES
AU - Chen, Gangqi
AU - Mao, Zhaoyong
AU - Shen, Junge
N1 - Publisher Copyright:
© 2024 IEEE
PY - 2024
Y1 - 2024
N2 - Sonar images object detection plays a crucial role in marine resource exploration and defense. Existing methods encounter challenges arising from non-rigid, deformed and low-resolution objects in sonar images, making them difficult to build a stable feature representation. In this paper, we present a novel object detection method based on linear cross-attention, aiming to construct a more robust feature representation tailored for sonar objects. Specifically, we introduce a novel feature fusion network designed to efficiently extract global object context. It helps to construct a more robust feature representation, effectively improving the model's ability to detect non-rigidly deformed and low-resolution objects. Moreover, we propose a linear attention mechanism to constitute the linear cross-attention module, leading to a significant reduction in computational load. Extensive experiments conducted on the public available dataset demonstrate the superiority of our approach. Our method surpasses a 1.4 mAP over the baseline, while requiring comparable parameters.
AB - Sonar images object detection plays a crucial role in marine resource exploration and defense. Existing methods encounter challenges arising from non-rigid, deformed and low-resolution objects in sonar images, making them difficult to build a stable feature representation. In this paper, we present a novel object detection method based on linear cross-attention, aiming to construct a more robust feature representation tailored for sonar objects. Specifically, we introduce a novel feature fusion network designed to efficiently extract global object context. It helps to construct a more robust feature representation, effectively improving the model's ability to detect non-rigidly deformed and low-resolution objects. Moreover, we propose a linear attention mechanism to constitute the linear cross-attention module, leading to a significant reduction in computational load. Extensive experiments conducted on the public available dataset demonstrate the superiority of our approach. Our method surpasses a 1.4 mAP over the baseline, while requiring comparable parameters.
KW - feature fusion
KW - linear attention mechanism
KW - Multibeam forward-looking sonar images
KW - transformer
KW - underwater object detection
UR - http://www.scopus.com/inward/record.url?scp=85216835241&partnerID=8YFLogxK
U2 - 10.1109/ICIP51287.2024.10647408
DO - 10.1109/ICIP51287.2024.10647408
M3 - 会议稿件
AN - SCOPUS:85216835241
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 1026
EP - 1031
BT - 2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
PB - IEEE Computer Society
T2 - 31st IEEE International Conference on Image Processing, ICIP 2024
Y2 - 27 October 2024 through 30 October 2024
ER -