TY - JOUR
T1 - A Unified SAM-Guided Self-Prompt Learning Framework for Infrared Small Target Detection
AU - Fu, Yimin
AU - Lyu, Jialin
AU - Ma, Peiyuan
AU - Liu, Zhunga
AU - Ng, Michael K.
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Infrared small target detection (ISTD) aims to precisely capture the location and morphology of small targets under all-weather conditions. Compared with generic objects, infrared targets in remote fields of view are smaller in size and exhibit lower signal-to-clutter ratios (SCRs). This poses a significant challenge in simultaneously preserving low-level target details and understanding high-level contextual semantics, forcing a tradeoff between reducing miss detection and suppressing false alarms. In addition, most existing ISTD methods are designed for specific target types under certain infrared platforms, rather than as a unified framework broadly applicable across diverse infrared sensing scenarios. To address these challenges, we propose a unified self-prompt learning (SPL) framework for ISTD under the guidance of the segment anything model (SAM). Specifically, the model is incorporated with the SAM in the encoding stage through a consult-guide manner, adapting the general knowledge to facilitate task-specific contextual understanding. Then, shallow-layer features are employed to generate self-derived prompts, which bidirectionally interact with encoded latent representations to complement subtle low-level details. Moreover, the semantic inconsistency during resolution recovery is mitigated by integrating a mutual calibration module into skip connections, ensuring coherent spatial-semantic fusion. Extensive experiments are conducted on four public ISTD datasets, and the results demonstrate that the proposed method consistently achieves superior performance across different infrared sensing platforms and target types. The code is released at https://github.com/fuyimin96/SAM-SPL
AB - Infrared small target detection (ISTD) aims to precisely capture the location and morphology of small targets under all-weather conditions. Compared with generic objects, infrared targets in remote fields of view are smaller in size and exhibit lower signal-to-clutter ratios (SCRs). This poses a significant challenge in simultaneously preserving low-level target details and understanding high-level contextual semantics, forcing a tradeoff between reducing miss detection and suppressing false alarms. In addition, most existing ISTD methods are designed for specific target types under certain infrared platforms, rather than as a unified framework broadly applicable across diverse infrared sensing scenarios. To address these challenges, we propose a unified self-prompt learning (SPL) framework for ISTD under the guidance of the segment anything model (SAM). Specifically, the model is incorporated with the SAM in the encoding stage through a consult-guide manner, adapting the general knowledge to facilitate task-specific contextual understanding. Then, shallow-layer features are employed to generate self-derived prompts, which bidirectionally interact with encoded latent representations to complement subtle low-level details. Moreover, the semantic inconsistency during resolution recovery is mitigated by integrating a mutual calibration module into skip connections, ensuring coherent spatial-semantic fusion. Extensive experiments are conducted on four public ISTD datasets, and the results demonstrate that the proposed method consistently achieves superior performance across different infrared sensing platforms and target types. The code is released at https://github.com/fuyimin96/SAM-SPL
KW - Infrared small target detection (ISTD)
KW - remote sensing
KW - segment anything model (SAM)
KW - self-prompt learning (SPL)
UR - https://www.scopus.com/pages/publications/105016733747
U2 - 10.1109/TGRS.2025.3610919
DO - 10.1109/TGRS.2025.3610919
M3 - 文章
AN - SCOPUS:105016733747
SN - 0196-2892
VL - 63
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5008014
ER -