TY - GEN
T1 - Source-Free One-Shot Infrared Video Object Segmentation Based on the Segment Anything Model
AU - Chen, Jiaqi
AU - Zhang, Dingwen
AU - Zhao, Weinan
AU - Li, Lei
AU - Ren, Jun
AU - Qi, Hang
AU - Lu, Ruitao
AU - Han, Junwei
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
PY - 2025
Y1 - 2025
N2 - In recent years, infrared video object segmentation has found extensive applications across various domains, including military surveillance, fire rescue and other fields. Despite the considerable potential of infrared imaging, segmenting objects in infrared video sequences remains a challenging task due to factors such as numerous video frames, limited available data, and complex backgrounds. To address these challenges, we propose a robust solution employing a large model adaptation strategy tailored for infrared datasets, coupled with a one-shot training approach to leverage information across video frames. Our framework, built upon the Segment Anything Model (SAM), effectively extends the parameters of a large model to accommodate infrared images, bridging the gap between training and testing video data and enhancing segmentation performance. The methodology involves supervised and unsupervised training segments, utilizing consistent and contrast loss mechanisms to ensure the model’s robustness and accuracy. Our approach has demonstrated experimentally its capability to effectively migrate parameters trained on visible light to the infrared domain, yielding excellent performance on the infrared dataset VTUAV.
AB - In recent years, infrared video object segmentation has found extensive applications across various domains, including military surveillance, fire rescue and other fields. Despite the considerable potential of infrared imaging, segmenting objects in infrared video sequences remains a challenging task due to factors such as numerous video frames, limited available data, and complex backgrounds. To address these challenges, we propose a robust solution employing a large model adaptation strategy tailored for infrared datasets, coupled with a one-shot training approach to leverage information across video frames. Our framework, built upon the Segment Anything Model (SAM), effectively extends the parameters of a large model to accommodate infrared images, bridging the gap between training and testing video data and enhancing segmentation performance. The methodology involves supervised and unsupervised training segments, utilizing consistent and contrast loss mechanisms to ensure the model’s robustness and accuracy. Our approach has demonstrated experimentally its capability to effectively migrate parameters trained on visible light to the infrared domain, yielding excellent performance on the infrared dataset VTUAV.
KW - Infrared Video Object Segmentation
KW - Segment Anything Model
KW - Source Free
UR - http://www.scopus.com/inward/record.url?scp=105000347413&partnerID=8YFLogxK
U2 - 10.1007/978-981-96-2260-3_52
DO - 10.1007/978-981-96-2260-3_52
M3 - 会议稿件
AN - SCOPUS:105000347413
SN - 9789819622597
T3 - Lecture Notes in Electrical Engineering
SP - 534
EP - 543
BT - Advances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 16
A2 - Yan, Liang
A2 - Duan, Haibin
A2 - Deng, Yimin
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Guidance, Navigation and Control, ICGNC 2024
Y2 - 9 August 2024 through 11 August 2024
ER -