RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision

Jinzhong Wang, Xuetao Tian, Shun Dai, Tao Zhuo, Haorui Zeng, Hongjuan Liu, Jiaqi Liu, Xiuwei Zhang, Yanning Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Multispectral object detection, utilizing both visible (RGB) and thermal infrared (T) modals, has garnered significant attention for its robust performance across diverse weather and lighting conditions. However, effectively exploiting the complementarity between RGB-T modals while maintaining efficiency remains a critical challenge. In this paper, a very simple Group Shuffled Multi-receptive Attention (GSMA) module is proposed to extract and combine multi-scale RGB and thermal features. Then, the extracted multi-modal features are directly integrated with a multi-level path aggregation neck, which significantly improves the fusion effect and efficiency. Meanwhile, multi-modal object detection often adopts union annotations for both modals. This kind of supervision is not sufficient and unfair, since objects observed in one modal may not be seen in the other modal. To solve this issue, Multi-modal Supervision (MS) is proposed to sufficiently supervise RGB-T object detection. Comprehensive experiments on two challenging benchmarks, KAIST and DroneVehicle, demonstrate the proposed model achieves the state-of-the-art accuracy while maintaining competitive efficiency.

源语言英语
主期刊名Pattern Recognition - 27th International Conference, ICPR 2024, Proceedings
编辑Apostolos Antonacopoulos, Subhasis Chaudhuri, Rama Chellappa, Cheng-Lin Liu, Saumik Bhattacharya, Umapada Pal
出版商Springer Science and Business Media Deutschland GmbH
284-298
页数15
ISBN(印刷版)9783031784460
DOI
出版状态已出版 - 2025
活动27th International Conference on Pattern Recognition, ICPR 2024 - Kolkata, 印度
期限: 1 12月 20245 12月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15317 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议27th International Conference on Pattern Recognition, ICPR 2024
国家/地区印度
Kolkata
时期1/12/245/12/24

指纹

探究 'RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision' 的科研主题。它们共同构成独一无二的指纹。

引用此