Skip to main navigation Skip to search Skip to main content

Infrared and Visible Image Fusion Model Based on Wavelet-Convolution and Transformer

  • Yanwei Chen
  • , Lihan Zheng
  • , Yapeng Wu
  • , Chen Yang
  • , Haoyan Guo
  • , Wentao Shi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Balancing local details and global structures remains a challenge in dual-spectrum image fusion, which integrates complementary information from infrared (IR) and visible (VI) sources. To address this, we propose WaveTransnet, a novel dual-branch network synergizing wavelet transforms and Transformers. The network processes IR and VI inputs through parallel branches. Within each branch, both wavelet convolutions and Transformer blocks are employed to concurrently extract high-frequency (HF) features capturing fine details and lowfrequency (LF) features representing structural context. An intramodal channel-spatial attention module then adaptively integrates these distinct HF and LF features derived from both the wavelet and Transformer paths within each modality (IR and VI separately), generating enhanced modality-specific HF and LF representations. Subsequently, cross-modal fusion merges the corresponding frequency components: the enhanced HF features from IR and VI are fused, and separately, the enhanced LF features are fused. Finally, the fused HF and LF representations are reconstructed into the final output image. Extensive experiments on the TNO datasets demonstrate that WaveTransnet achieves state-of-the-art performance, surpassing existing methods across multiple objective metrics (including EN, MI, SF, AG, SD, VIF) and subjective visual quality. Notably, the model effectively preserves detailed background textures from VI images while retaining salient thermal targets from IR images, highlighting its strong potential for practical applications by effectively leveraging frequency-specific information from both wavelet and Transformer perspectives.

Original languageEnglish
Title of host publication2025 10th International Symposium on Advances in Electrical, Electronics and Computer Engineering, ISAEECE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages268-272
Number of pages5
ISBN (Electronic)9798331513382
DOIs
StatePublished - 2025
Event10th International Symposium on Advances in Electrical, Electronics and Computer Engineering, ISAEECE 2025 - Xi'an, China
Duration: 20 Jun 202522 Jun 2025

Publication series

Name2025 10th International Symposium on Advances in Electrical, Electronics and Computer Engineering, ISAEECE 2025

Conference

Conference10th International Symposium on Advances in Electrical, Electronics and Computer Engineering, ISAEECE 2025
Country/TerritoryChina
CityXi'an
Period20/06/2522/06/25

Keywords

  • Dual-Branch Architecture
  • Image Fusion
  • Transformer
  • Wavelet Convolution

Fingerprint

Dive into the research topics of 'Infrared and Visible Image Fusion Model Based on Wavelet-Convolution and Transformer'. Together they form a unique fingerprint.

Cite this