Unidirectional Cross-Modal Fusion for RGB-T Tracking

Xiao Guo, Hangfei Li, Yufei Zha, Peng Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The key issue of RGB-T tracking is to obtain an effective multimodal representation of targets by utilizing complementary RGB and TIR modality information. Previous methods of template fusion or bidirectional search-template interaction potentially diminish the target representation, resulting from noise information of both templates and search regions. Meanwhile, the direct fusion of sole search features without interacting with templates cannot fully utilize target-relevant contextual information. To mitigate these issues, we present UCTrack, which fuses complementary multimodal search features conditioned on undisturbed RGB and TIR template features. Specifically, we design a Unidirectional Cross-modal Fusion (UCF) module to effectively minimize the influence of background noise on templates by pruning the unnecessary template-to-search cross-modal interaction and to mutually enhance RGB and TIR search features with target-relevant information through multimodal spatial fusion. Furthermore, this module is seamlessly integrated into different layers of a ViT backbone to facilitate feature extraction and cross-modal fusion for RGB-T tracking. Benefiting from the UCF module, UCTrack can effectively and accurately represent multimodal target features without unnecessary template-to-search interaction flow and direct template fusion, making the first proposal of unidirectional cross-modal fusion paradigm for RGB-T tracking to our best knowledge. Extensive experiments on three popular RGB-T tracking benchmarks demonstrate that our method achieves state-of-the-art performance.

源语言英语
主期刊名ECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings
编辑Ulle Endriss, Francisco S. Melo, Kerstin Bach, Alberto Bugarin-Diz, Jose M. Alonso-Moral, Senen Barro, Fredrik Heintz
出版商IOS Press BV
490-497
页数8
ISBN(电子版)9781643685489
DOI
出版状态已出版 - 16 10月 2024
活动27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, 西班牙
期限: 19 10月 202424 10月 2024

出版系列

姓名Frontiers in Artificial Intelligence and Applications
392
ISSN(印刷版)0922-6389
ISSN(电子版)1879-8314

会议

会议27th European Conference on Artificial Intelligence, ECAI 2024
国家/地区西班牙
Santiago de Compostela
时期19/10/2424/10/24

指纹

探究 'Unidirectional Cross-Modal Fusion for RGB-T Tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此