CS-ViG-UNet: Infrared small and dim target detection based on cycle shift vision graph convolution network

Jian Lin, Shaoyi Li, Xi Yang, Saisai Niu, Binbin Yan, Zhongjie Meng

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Infrared small and dim target detection benefits from the exploration of correlations among targets, neighboring regions, and the background. However, existing methods that rely on convolutional neural networks and vision transformers cannot effectively capture long-range information correlations within images. To overcome this limitation, this paper proposes CS-ViG-UNet, a framework that introduces vision graph convolution for infrared small and dim target detection. Our framework employs a cyclic shift sparse graph attention mechanism to address the issue of reduced expressive power. Meanwhile, the CS-ViG module is designed to construct an effective graph structure using image patches, thereby capturing feature information relevant to target recognition. On the public datasets Sirst AUG and IRSTD-1K, our method obtained F1 scores of 0.8561 and 0.745, respectively, showing an improvement of 3.15 % and 4.1 % compared to the state-of-the-art methods. On the RTX3090 with TensorRT acceleration, CS-ViG-UNet can process approximately 357 images of size 256 × 256 pixels per second at FP16 precision. For detailed information, please visit our homepage: https://linaom1214.github.io/CSViG-UNet.

Original languageEnglish
Article number124385
JournalExpert Systems with Applications
Volume254
DOIs
StatePublished - 15 Nov 2024

Keywords

  • Infrared small and dim target
  • Infrared target detection
  • U-shape architecture
  • Vision graph network

Fingerprint

Dive into the research topics of 'CS-ViG-UNet: Infrared small and dim target detection based on cycle shift vision graph convolution network'. Together they form a unique fingerprint.

Cite this