TY - JOUR
T1 - IR-TransDet
T2 - Infrared Dim and Small Target Detection with IR-Transformer
AU - Lin, Jian
AU - Li, Shaoyi
AU - Zhang, Liang
AU - Yang, Xi
AU - Yan, Binbin
AU - Meng, Zhongjie
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2023
Y1 - 2023
N2 - Infrared dim and small target detection is one of the crucial technologies in the military field, but it faces various challenges such as weak features and small target scales. To overcome these challenges, this article proposes IR-TransDet, which integrates the benefits of the convolutional neural network (CNN) and the Transformer, to properly extract global semantic information and features of small targets. First, the efficient feature extraction module (EFEM) is designed, which uses depthwise convolution and pointwise convolution (PW Conv) to effectively capture the features of the target. Then, an improved Residual Sim atrous spatial pyramid pooling (ASPP) module is proposed based on the image characteristics of infrared dim and small targets. The proposed method focuses on enhancing the edge information of the target. Meanwhile, an IR-Transformer module is devised, which uses the self-attention mechanism to investigate the relationship between the global image, the target, and neighboring pixels. Finally, experiments were conducted on four open datasets, and the results indicate that IR-TransDet achieves state-of-the-art performance in infrared dim and small target detection. To achieve a comparative evaluation of the existing infrared dim and small target detection methods, this study constructed the ISTD-Benchmark tool, which is available at https://linaom1214.github.io/ISTD-Benchmark.
AB - Infrared dim and small target detection is one of the crucial technologies in the military field, but it faces various challenges such as weak features and small target scales. To overcome these challenges, this article proposes IR-TransDet, which integrates the benefits of the convolutional neural network (CNN) and the Transformer, to properly extract global semantic information and features of small targets. First, the efficient feature extraction module (EFEM) is designed, which uses depthwise convolution and pointwise convolution (PW Conv) to effectively capture the features of the target. Then, an improved Residual Sim atrous spatial pyramid pooling (ASPP) module is proposed based on the image characteristics of infrared dim and small targets. The proposed method focuses on enhancing the edge information of the target. Meanwhile, an IR-Transformer module is devised, which uses the self-attention mechanism to investigate the relationship between the global image, the target, and neighboring pixels. Finally, experiments were conducted on four open datasets, and the results indicate that IR-TransDet achieves state-of-the-art performance in infrared dim and small target detection. To achieve a comparative evaluation of the existing infrared dim and small target detection methods, this study constructed the ISTD-Benchmark tool, which is available at https://linaom1214.github.io/ISTD-Benchmark.
KW - IR-transformer
KW - ISTD-Benchmark tool
KW - Infrared dim and small target detection
KW - Sim atrous spatial pyramid pooling (ASPP)
KW - self-attention mechanism
UR - http://www.scopus.com/inward/record.url?scp=85176369502&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2023.3327317
DO - 10.1109/TGRS.2023.3327317
M3 - 文章
AN - SCOPUS:85176369502
SN - 0196-2892
VL - 61
SP - 1
EP - 13
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5004813
ER -