TY - GEN
T1 - Infrared and Visible Image Fusion Using Ternary Cycle-Consistent Adversarial Networks
AU - Ge, Kaiyang
AU - Wang, Xue
AU - Han, Shuaiteng
AU - Zhou, Guoqing
AU - Wang, Qing
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Infrared and visible image fusion (IVIF) integrates complementary information from distinct spectral bands to augment image quality and scene understanding, such as object detection. Exsiting methods generally assume identical modality availability during training and testing. To achieve accurate and robust object detection in real-world applications, it is necessary to consider modality drop scenarioes. This paper proposes a novel framework leveraging triadic training data to establish dual bidirectional mappings between source modalities and the fusion domain. The learned mappings include two core generative paths for synthesizing fused images from infrared/visible inputs (infrared → fused, visible → fused), and two auxiliary reconstruciton paths enforcing semantic consistency through inverse translations (infrared ← fused, visible ← fused). To address the under-constraint issue of these mappings across infrared and visible modalities, except for the adversarial loss, we introduce: (i) the ternary cycle-consitency loss enforcing mutual coherence among the dual bidirectional mappings; and (ii) the hybrid supervision loss combining a fusion loss ensuring pixel-wise fidelity to ground truth and a reconstruction loss regularizing auxiliary mappings. To evaluate the performance of the proposed method, we constructed a novel dataset for IVIF and object detection, named DroneCar, which is collected based on an unmanned aerial vehicle (UAV) platform. Experimental results on both DroneCar and three public datasets demonstrate that the proposed method outperforms existing state-of-the-art approaches, especially improving the downstream object detection accuracy of unimodal networks when compared to modality fusion methods across multiple IVIF datasets.
AB - Infrared and visible image fusion (IVIF) integrates complementary information from distinct spectral bands to augment image quality and scene understanding, such as object detection. Exsiting methods generally assume identical modality availability during training and testing. To achieve accurate and robust object detection in real-world applications, it is necessary to consider modality drop scenarioes. This paper proposes a novel framework leveraging triadic training data to establish dual bidirectional mappings between source modalities and the fusion domain. The learned mappings include two core generative paths for synthesizing fused images from infrared/visible inputs (infrared → fused, visible → fused), and two auxiliary reconstruciton paths enforcing semantic consistency through inverse translations (infrared ← fused, visible ← fused). To address the under-constraint issue of these mappings across infrared and visible modalities, except for the adversarial loss, we introduce: (i) the ternary cycle-consitency loss enforcing mutual coherence among the dual bidirectional mappings; and (ii) the hybrid supervision loss combining a fusion loss ensuring pixel-wise fidelity to ground truth and a reconstruction loss regularizing auxiliary mappings. To evaluate the performance of the proposed method, we constructed a novel dataset for IVIF and object detection, named DroneCar, which is collected based on an unmanned aerial vehicle (UAV) platform. Experimental results on both DroneCar and three public datasets demonstrate that the proposed method outperforms existing state-of-the-art approaches, especially improving the downstream object detection accuracy of unimodal networks when compared to modality fusion methods across multiple IVIF datasets.
KW - Generative Adversarial Network
KW - Image Fusion
KW - Modality Drop
KW - Multimodal Learning
KW - Object Detection
UR - https://www.scopus.com/pages/publications/105035368240
U2 - 10.1109/ICVRV67992.2025.00124
DO - 10.1109/ICVRV67992.2025.00124
M3 - 会议稿件
AN - SCOPUS:105035368240
T3 - Proceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
SP - 697
EP - 702
BT - Proceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
Y2 - 19 December 2025 through 21 December 2025
ER -