Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking

Yufei Zha, Jingxian Sun, Peng Zhang, Lichao Zhang, Abel Gonzalez-Garcia, Wei Huang

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Target representations play an important role in performance improvement for Thermal Infrared tracking. To tackle this problem, we propose a Cross-Modal Distillation method to distill representations of the TIR modality from the RGB modality, which conducts on a large amount of unlabeled paired RGB-TIR data in a self-supervised way. Benefiting from the powerful model in the RGB modality, the cross-modal distillation can learn the TIR-specific representation for promoting TIR tracking. The proposed approach can be incorporated into different baseline trackers conveniently as a generic and independent component. In practice, three different approaches are explored to generate paired RGB-TIR patches with the same semantics for training in a self-supervised way. It is easy to extend to an even larger scale of unlabeled training data. Our tracker outperforms the baseline tracker by achieving an absolute gain of 2.3% Success Rate, 2.7% Precision, and 2.5% Norm Precision on published datasets, respectively.

源语言英语
页(从-至)80-96
页数17
期刊IEEE Multimedia
29
4
DOI
出版状态已出版 - 1 10月 2022

指纹

探究 'Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此