Skip to main navigation Skip to search Skip to main content

CCSFuse: Collaborative Compensation and Selective Fusion for UAV-Based RGB-IR Object Detection

  • Tao Zhang
  • , Ruitao Lu
  • , Xiaogang Yang
  • , Dingwen Zhang
  • , Yansheng Li
  • , Xueli Xie
  • , Yunsong Li
  • Rocket Force University of Engineering
  • Wuhan University
  • Xidian University

Research output: Contribution to journalArticlepeer-review

Abstract

Visible-Infrared (RGB-IR) object detection plays a crucial role in uncrewed aerial vehicle (UAV)-based vision tasks. However, existing methods still suffer from learning bias caused by imbalanced information distribution and inaccurate fusion due to modal conflicts. Inspired by the human multisensory information processing mechanism, we propose a novel “Compensation-Fusion” progressive detection framework, CCSFuse, to fully exploit the complementary relationship between modalities while eliminating conflict interference. Specifically, we design a cross-modal feature compensation (CMFC) module, which establishes intermodal information interaction to achieve mutual complementarity and enhancement during feature extraction, effectively mitigating the issue of imbalanced modal information distribution. Additionally, we introduce an adaptive feature-selection fusion (AFSF) module to address modal conflicts. We employ a cross-modal channel attention to calibrate channel features of different modalities and use a selective fusion strategy to dynamically assess modal importance, thereby achieving adaptive modal fusion. Finally, we validate the effectiveness of CCSFuse on the drone vehicle and LLVIP datasets. The results confirm that CCSFuse significantly improves the efficiency of feature optimization and integration. In UAV-based object detection scenarios, CCSFuse outperforms state-of-the-art methods in both qualitative and quantitative comparisons, particularly for small objects and low-quality modalities.

Original languageEnglish
Article number5400314
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume64
DOIs
StatePublished - 2026

Keywords

  • Channel-wise calibration
  • cross-modal feature compensation (CMFC)
  • image fusion
  • visible-infrared object detection

Fingerprint

Dive into the research topics of 'CCSFuse: Collaborative Compensation and Selective Fusion for UAV-Based RGB-IR Object Detection'. Together they form a unique fingerprint.

Cite this