Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion

Jiaxin Yao; Yongqiang Zhao; Yuanyang Bu; Seong G. Kong; Jonathan Cheung Wai Chan

doi:10.1109/TCSVT.2023.3245607

Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion

Jiaxin Yao, Yongqiang Zhao, Yuanyang Bu, Seong G. Kong, Jonathan Cheung Wai Chan

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

38 引用（Scopus）

摘要

The fusion of infrared and visible images combines the information from two complementary imaging modalities for various computer vision tasks. Many existing techniques, however, fail to maintain a uniform overall style and keep salient details of individual modalities simultaneously. This paper presents an end-to-end Laplacian Pyramid Fusion Network with hierarchical guidance (HG-LPFN) that takes advantage of pixel-level saliency reservation of Laplacian Pyramid and global optimization capability of deep learning. The proposed scheme generates hierarchical saliency maps through Laplacian Pyramid decomposition and modal difference calculation. In the pyramid fusion mode, all sub-networks are connected in a bottom-up manner. The sub-network for low-frequency fusion focuses on extracting universal features to produce an opposite style while sub-networks for high-frequency fusion determine how much the details of each modality will be retained. Taking the style, details, and background into consideration, we design a set of novel loss functions to supervise both low-frequency images and full-resolution images under the guidance of saliency maps. Experimental results on public datasets demonstrate that the proposed HG-LPFN outperforms the state-of-the-art image fusion techniques.

源语言	英语
页（从-至）	4630-4644
页数	15
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	33
期	9
DOI	https://doi.org/10.1109/TCSVT.2023.3245607
出版状态	已出版 - 1 9月 2023

访问文件

10.1109/TCSVT.2023.3245607

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2131337a91c74203af15d0d648905afe,

title = "Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion",

abstract = "The fusion of infrared and visible images combines the information from two complementary imaging modalities for various computer vision tasks. Many existing techniques, however, fail to maintain a uniform overall style and keep salient details of individual modalities simultaneously. This paper presents an end-to-end Laplacian Pyramid Fusion Network with hierarchical guidance (HG-LPFN) that takes advantage of pixel-level saliency reservation of Laplacian Pyramid and global optimization capability of deep learning. The proposed scheme generates hierarchical saliency maps through Laplacian Pyramid decomposition and modal difference calculation. In the pyramid fusion mode, all sub-networks are connected in a bottom-up manner. The sub-network for low-frequency fusion focuses on extracting universal features to produce an opposite style while sub-networks for high-frequency fusion determine how much the details of each modality will be retained. Taking the style, details, and background into consideration, we design a set of novel loss functions to supervise both low-frequency images and full-resolution images under the guidance of saliency maps. Experimental results on public datasets demonstrate that the proposed HG-LPFN outperforms the state-of-the-art image fusion techniques.",

keywords = "deep learning, Infrared and visible image fusion, Laplacian pyramid",

author = "Jiaxin Yao and Yongqiang Zhao and Yuanyang Bu and Kong, {Seong G.} and Chan, {Jonathan Cheung Wai}",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2023",

month = sep,

day = "1",

doi = "10.1109/TCSVT.2023.3245607",

language = "英语",

volume = "33",

pages = "4630--4644",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "9",

}

TY - JOUR

T1 - Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion

AU - Yao, Jiaxin

AU - Zhao, Yongqiang

AU - Bu, Yuanyang

AU - Kong, Seong G.

AU - Chan, Jonathan Cheung Wai

PY - 2023/9/1

Y1 - 2023/9/1

N2 - The fusion of infrared and visible images combines the information from two complementary imaging modalities for various computer vision tasks. Many existing techniques, however, fail to maintain a uniform overall style and keep salient details of individual modalities simultaneously. This paper presents an end-to-end Laplacian Pyramid Fusion Network with hierarchical guidance (HG-LPFN) that takes advantage of pixel-level saliency reservation of Laplacian Pyramid and global optimization capability of deep learning. The proposed scheme generates hierarchical saliency maps through Laplacian Pyramid decomposition and modal difference calculation. In the pyramid fusion mode, all sub-networks are connected in a bottom-up manner. The sub-network for low-frequency fusion focuses on extracting universal features to produce an opposite style while sub-networks for high-frequency fusion determine how much the details of each modality will be retained. Taking the style, details, and background into consideration, we design a set of novel loss functions to supervise both low-frequency images and full-resolution images under the guidance of saliency maps. Experimental results on public datasets demonstrate that the proposed HG-LPFN outperforms the state-of-the-art image fusion techniques.

AB - The fusion of infrared and visible images combines the information from two complementary imaging modalities for various computer vision tasks. Many existing techniques, however, fail to maintain a uniform overall style and keep salient details of individual modalities simultaneously. This paper presents an end-to-end Laplacian Pyramid Fusion Network with hierarchical guidance (HG-LPFN) that takes advantage of pixel-level saliency reservation of Laplacian Pyramid and global optimization capability of deep learning. The proposed scheme generates hierarchical saliency maps through Laplacian Pyramid decomposition and modal difference calculation. In the pyramid fusion mode, all sub-networks are connected in a bottom-up manner. The sub-network for low-frequency fusion focuses on extracting universal features to produce an opposite style while sub-networks for high-frequency fusion determine how much the details of each modality will be retained. Taking the style, details, and background into consideration, we design a set of novel loss functions to supervise both low-frequency images and full-resolution images under the guidance of saliency maps. Experimental results on public datasets demonstrate that the proposed HG-LPFN outperforms the state-of-the-art image fusion techniques.

KW - deep learning

KW - Infrared and visible image fusion

KW - Laplacian pyramid

UR - http://www.scopus.com/inward/record.url?scp=85149386219&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3245607

DO - 10.1109/TCSVT.2023.3245607

M3 - 文章

AN - SCOPUS:85149386219

SN - 1051-8215

VL - 33

SP - 4630

EP - 4644

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 9

ER -

Laplacian Pyramid Fusion Network With Hierarchical Guidance for Infrared and Visible Image Fusion

摘要

访问文件

其它文件与链接

指纹

引用此