Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling

Zixiang Zhao; Shuang Xu; Jiangshe Zhang; Chengyang Liang; Chunxia Zhang; Junmin Liu

doi:10.1109/TCSVT.2021.3075745

Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling

Zixiang Zhao, Shuang Xu, Jiangshe Zhang, Chengyang Liang, Chunxia Zhang, Junmin Liu

Xi'an Jiaotong University

科研成果: 期刊稿件 › 文章 › 同行评审

151 引用（Scopus）

摘要

Infrared and visible image fusion (IVIF) expects to obtain images that retain thermal radiation information from infrared images and texture details from visible images. In this paper, a model-based convolutional neural network (CNN) model, referred to as Algorithm Unrolling Image Fusion (AUIF), is proposed to overcome the shortcomings of traditional CNN-based IVIF models. The proposed AUIF model starts with the iterative formulas of two traditional optimization models, which are established to accomplish two-scale decomposition, i.e., separating low-frequency base information and high-frequency detail information from source images. Then the algorithm unrolling is implemented where each iteration is mapped to a CNN layer and each optimization model is transformed into a trainable neural network. Compared with the general network architectures, the proposed framework combines the model-based prior information and is designed more reasonably. After the unrolling operation, our model contains two decomposers (encoders) and an additional reconstructor (decoder). In the training phase, this network is trained to reconstruct the input image. While in the test phase, the base (or detail) decomposed feature maps of infrared/visible images are merged respectively by an extra fusion layer, and then the decoder outputs the fusion image. Qualitative and quantitative comparisons demonstrate the superiority of our model, which can robustly generate fusion images containing highlight targets and legible details, exceeding the state-of-the-art methods. Furthermore, our network has fewer weights and faster speed.

源语言	英语
页（从-至）	1186-1196
页数	11
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	32
期	3
DOI	https://doi.org/10.1109/TCSVT.2021.3075745
出版状态	已出版 - 1 3月 2022
已对外发布	是

访问文件

10.1109/TCSVT.2021.3075745

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{6bbaed2cd8a34b32ae2f82e6542b146b,

title = "Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling",

abstract = "Infrared and visible image fusion (IVIF) expects to obtain images that retain thermal radiation information from infrared images and texture details from visible images. In this paper, a model-based convolutional neural network (CNN) model, referred to as Algorithm Unrolling Image Fusion (AUIF), is proposed to overcome the shortcomings of traditional CNN-based IVIF models. The proposed AUIF model starts with the iterative formulas of two traditional optimization models, which are established to accomplish two-scale decomposition, i.e., separating low-frequency base information and high-frequency detail information from source images. Then the algorithm unrolling is implemented where each iteration is mapped to a CNN layer and each optimization model is transformed into a trainable neural network. Compared with the general network architectures, the proposed framework combines the model-based prior information and is designed more reasonably. After the unrolling operation, our model contains two decomposers (encoders) and an additional reconstructor (decoder). In the training phase, this network is trained to reconstruct the input image. While in the test phase, the base (or detail) decomposed feature maps of infrared/visible images are merged respectively by an extra fusion layer, and then the decoder outputs the fusion image. Qualitative and quantitative comparisons demonstrate the superiority of our model, which can robustly generate fusion images containing highlight targets and legible details, exceeding the state-of-the-art methods. Furthermore, our network has fewer weights and faster speed.",

keywords = "Image fusion, algorithm unrolling, model-based network structure, two-scale decomposition",

author = "Zixiang Zhao and Shuang Xu and Jiangshe Zhang and Chengyang Liang and Chunxia Zhang and Junmin Liu",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2022",

month = mar,

day = "1",

doi = "10.1109/TCSVT.2021.3075745",

language = "英语",

volume = "32",

pages = "1186--1196",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling

AU - Zhao, Zixiang

AU - Xu, Shuang

AU - Zhang, Jiangshe

AU - Liang, Chengyang

AU - Zhang, Chunxia

AU - Liu, Junmin

PY - 2022/3/1

Y1 - 2022/3/1

N2 - Infrared and visible image fusion (IVIF) expects to obtain images that retain thermal radiation information from infrared images and texture details from visible images. In this paper, a model-based convolutional neural network (CNN) model, referred to as Algorithm Unrolling Image Fusion (AUIF), is proposed to overcome the shortcomings of traditional CNN-based IVIF models. The proposed AUIF model starts with the iterative formulas of two traditional optimization models, which are established to accomplish two-scale decomposition, i.e., separating low-frequency base information and high-frequency detail information from source images. Then the algorithm unrolling is implemented where each iteration is mapped to a CNN layer and each optimization model is transformed into a trainable neural network. Compared with the general network architectures, the proposed framework combines the model-based prior information and is designed more reasonably. After the unrolling operation, our model contains two decomposers (encoders) and an additional reconstructor (decoder). In the training phase, this network is trained to reconstruct the input image. While in the test phase, the base (or detail) decomposed feature maps of infrared/visible images are merged respectively by an extra fusion layer, and then the decoder outputs the fusion image. Qualitative and quantitative comparisons demonstrate the superiority of our model, which can robustly generate fusion images containing highlight targets and legible details, exceeding the state-of-the-art methods. Furthermore, our network has fewer weights and faster speed.

AB - Infrared and visible image fusion (IVIF) expects to obtain images that retain thermal radiation information from infrared images and texture details from visible images. In this paper, a model-based convolutional neural network (CNN) model, referred to as Algorithm Unrolling Image Fusion (AUIF), is proposed to overcome the shortcomings of traditional CNN-based IVIF models. The proposed AUIF model starts with the iterative formulas of two traditional optimization models, which are established to accomplish two-scale decomposition, i.e., separating low-frequency base information and high-frequency detail information from source images. Then the algorithm unrolling is implemented where each iteration is mapped to a CNN layer and each optimization model is transformed into a trainable neural network. Compared with the general network architectures, the proposed framework combines the model-based prior information and is designed more reasonably. After the unrolling operation, our model contains two decomposers (encoders) and an additional reconstructor (decoder). In the training phase, this network is trained to reconstruct the input image. While in the test phase, the base (or detail) decomposed feature maps of infrared/visible images are merged respectively by an extra fusion layer, and then the decoder outputs the fusion image. Qualitative and quantitative comparisons demonstrate the superiority of our model, which can robustly generate fusion images containing highlight targets and legible details, exceeding the state-of-the-art methods. Furthermore, our network has fewer weights and faster speed.

KW - Image fusion

KW - algorithm unrolling

KW - model-based network structure

KW - two-scale decomposition

UR - http://www.scopus.com/inward/record.url?scp=85105045378&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2021.3075745

DO - 10.1109/TCSVT.2021.3075745

M3 - 文章

AN - SCOPUS:85105045378

SN - 1051-8215

VL - 32

SP - 1186

EP - 1196

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 3

ER -

Efficient and Model-Based Infrared and Visible Image Fusion via Algorithm Unrolling

摘要

访问文件

其它文件与链接

指纹

引用此