跳到主要导航 跳到搜索 跳到主要内容

Infrared and Visible Image Fusion Using Ternary Cycle-Consistent Adversarial Networks

  • Kaiyang Ge
  • , Xue Wang
  • , Shuaiteng Han
  • , Guoqing Zhou
  • , Qing Wang
  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Infrared and visible image fusion (IVIF) integrates complementary information from distinct spectral bands to augment image quality and scene understanding, such as object detection. Exsiting methods generally assume identical modality availability during training and testing. To achieve accurate and robust object detection in real-world applications, it is necessary to consider modality drop scenarioes. This paper proposes a novel framework leveraging triadic training data to establish dual bidirectional mappings between source modalities and the fusion domain. The learned mappings include two core generative paths for synthesizing fused images from infrared/visible inputs (infrared → fused, visible → fused), and two auxiliary reconstruciton paths enforcing semantic consistency through inverse translations (infrared ← fused, visible ← fused). To address the under-constraint issue of these mappings across infrared and visible modalities, except for the adversarial loss, we introduce: (i) the ternary cycle-consitency loss enforcing mutual coherence among the dual bidirectional mappings; and (ii) the hybrid supervision loss combining a fusion loss ensuring pixel-wise fidelity to ground truth and a reconstruction loss regularizing auxiliary mappings. To evaluate the performance of the proposed method, we constructed a novel dataset for IVIF and object detection, named DroneCar, which is collected based on an unmanned aerial vehicle (UAV) platform. Experimental results on both DroneCar and three public datasets demonstrate that the proposed method outperforms existing state-of-the-art approaches, especially improving the downstream object detection accuracy of unimodal networks when compared to modality fusion methods across multiple IVIF datasets.

源语言英语
主期刊名Proceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
出版商Institute of Electrical and Electronics Engineers Inc.
697-702
页数6
ISBN(电子版)9798331556297
DOI
出版状态已出版 - 2025
活动2025 International Conference on Virtual Reality and Visualization, ICVRV 2025 - Bogota, 哥伦比亚
期限: 19 12月 202521 12月 2025

出版系列

姓名Proceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025

会议

会议2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
国家/地区哥伦比亚
Bogota
时期19/12/2521/12/25

指纹

探究 'Infrared and Visible Image Fusion Using Ternary Cycle-Consistent Adversarial Networks' 的科研主题。它们共同构成独一无二的指纹。

引用此