跳到主要导航 跳到搜索 跳到主要内容

Multimodal Dual-domain Learning for Image Fusion

  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Multimodal image fusion aims to generate high-resolution hyperspectral images by leveraging the complementary characteristics of spatially and spectrally high-resolution data. However, most existing approaches focus solely on fusion in a spatial domain, while neglecting the potential of frequency-domain information. To address this limitation, this paper proposes a dual-domain learning network that effectively integrates multimodal information from both spatial and frequency domains. In order to explore spatial and frequency domain information, a core module is customized for image fusion, which called the dual-domain fusion module. It consists of two branches that are the spatial domain branch and the frequency domain branch. In the frequency domain branch, the phase and amplitude information of different modes are explored to achieve multi-modal information fusion in the frequency domain. The fusion of dual-domain information helps the model to mine richer context information and improve the detailed reasoning ability of the multi-modal fusion model. Experimental results on two public datasets show that the performance of the proposed network is better than those of other peers.

源语言英语
主期刊名Proceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025
出版商Institute of Electrical and Electronics Engineers Inc.
6545-6554
页数10
ISBN(电子版)9798331589882
DOI
出版状态已出版 - 2025
活动2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025 - Honolulu, 美国
期限: 19 10月 202520 10月 2025

出版系列

姓名Proceedings - 2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025

会议

会议2025 IEEE/CVF International Conference on Computer Vision Workshops, ICCV-W 2025
国家/地区美国
Honolulu
时期19/10/2520/10/25

指纹

探究 'Multimodal Dual-domain Learning for Image Fusion' 的科研主题。它们共同构成独一无二的指纹。

引用此