Source-Free One-Shot Infrared Video Object Segmentation Based on the Segment Anything Model

Jiaqi Chen, Dingwen Zhang, Weinan Zhao, Lei Li, Jun Ren, Hang Qi, Ruitao Lu, Junwei Han

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In recent years, infrared video object segmentation has found extensive applications across various domains, including military surveillance, fire rescue and other fields. Despite the considerable potential of infrared imaging, segmenting objects in infrared video sequences remains a challenging task due to factors such as numerous video frames, limited available data, and complex backgrounds. To address these challenges, we propose a robust solution employing a large model adaptation strategy tailored for infrared datasets, coupled with a one-shot training approach to leverage information across video frames. Our framework, built upon the Segment Anything Model (SAM), effectively extends the parameters of a large model to accommodate infrared images, bridging the gap between training and testing video data and enhancing segmentation performance. The methodology involves supervised and unsupervised training segments, utilizing consistent and contrast loss mechanisms to ensure the model’s robustness and accuracy. Our approach has demonstrated experimentally its capability to effectively migrate parameters trained on visible light to the infrared domain, yielding excellent performance on the infrared dataset VTUAV.

Original languageEnglish
Title of host publicationAdvances in Guidance, Navigation and Control - Proceedings of 2024 International Conference on Guidance, Navigation and Control Volume 16
EditorsLiang Yan, Haibin Duan, Yimin Deng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages534-543
Number of pages10
ISBN (Print)9789819622597
DOIs
StatePublished - 2025
EventInternational Conference on Guidance, Navigation and Control, ICGNC 2024 - Changsha, China
Duration: 9 Aug 202411 Aug 2024

Publication series

NameLecture Notes in Electrical Engineering
Volume1352 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

ConferenceInternational Conference on Guidance, Navigation and Control, ICGNC 2024
Country/TerritoryChina
CityChangsha
Period9/08/2411/08/24

Keywords

  • Infrared Video Object Segmentation
  • Segment Anything Model
  • Source Free

Fingerprint

Dive into the research topics of 'Source-Free One-Shot Infrared Video Object Segmentation Based on the Segment Anything Model'. Together they form a unique fingerprint.

Cite this