跳到主要导航 跳到搜索 跳到主要内容

PLVAM-DETR: Patch-Level Visibility Aware Multi-spectral Detection Transformer with Frequency Specific Fusion

  • Xiuwei Zhang
  • , Haorui Zeng
  • , Xiaoqiang Zhang
  • , Wencong Wu
  • , Hanlin Yin
  • , Shun Dai
  • , Yanning Zhang
  • Northwestern Polytechnical University Xian
  • Southwest University of Science and Technology

科研成果: 期刊稿件文章同行评审

摘要

Multi-spectral object detection demonstrates enhanced robustness compared to single-spectral object detection by leveraging complementary information from both visible and infrared spectra. Current presented methods often fuse multi-spectral information using attention mechanisms, such as spatial/channel attention for CNN-based approaches and cross-attention/self-attention for Transformer-based approaches. While, object visibility across different regions in different spectral images is varying. Therefore, explicit patch-level visibility-awareness is required to perform finer spatial and spectral exploration. Moreover, the distinct frequency characteristics of infrared and visible images are rarely highlighted, hindering effective utilization of their complementary benefits. To overcome these limitations, we propose a patch-level visibility aware multi-spectral detection Transformer with frequency specific fusion, named PLVAM-DETR. A patch-level visibility aware module is designed to dynamically determine the significance across different patches in different spectral images. Then, a frequency specific feature fusion module is presented to highlight high-frequency information in the infrared features and low-frequency information in the visible features, which provides more comprehensive feature for detection. Extensive experiments on publicly available datasets demonstrate competitive results compared to state-of-the-art methods.

源语言英语
期刊IEEE Transactions on Multimedia
DOI
出版状态已接受/待刊 - 2026

指纹

探究 'PLVAM-DETR: Patch-Level Visibility Aware Multi-spectral Detection Transformer with Frequency Specific Fusion' 的科研主题。它们共同构成独一无二的指纹。

引用此