HMF-Former: Spatio-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

Tengfei You; Chanyue Wu; Yunpeng Bai; Dong Wang; Huibin Ge; Ying Li

doi:10.1109/LGRS.2022.3229692

HMF-Former: Spatio-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

Tengfei You, Chanyue Wu, Yunpeng Bai, Dong Wang, Huibin Ge, Ying Li

School of Computer Science

Research output: Contribution to journal › Article › peer-review

21 Scopus citations

Abstract

The key to hyperspectral image (HSI) and multispectral image (MSI) fusion is to take advantage of the properties of interspectra self-similarities of HSIs and spatial correlations of MSIs. However, leading convolutional neural network (CNN)-based methods show shortcomings in capturing long-range dependencies and self-similarity prior. To this end, we propose a simple yet efficient Transformer-based network, hyperspectral and multispectral image fusion (HMF)-Former, for the HSI/MSI fusion. The HMF-Former adopts a U-shaped architecture with a spatio-spectral Transformer block (SSTB) as the basic unit. In the SSTB, embedded spatial-wise multihead self-attention (Spa-MSA) and spectral-wise multihead self-attention (Spe-MSA) effectively capture interactions of spatial regions and interspectra dependencies, respectively. They are consistent with the properties of spatial correlations of MSIs and interspectra self-similarities of HSIs. In addition, specially designed SSTB enables the HMF-Former to capture both local and global features while maintaining linear complexity. Extensive experiments on four benchmark datasets show that our method significantly outperforms state-of-the-art methods.

Original language	English
Article number	5500505
Journal	IEEE Geoscience and Remote Sensing Letters
Volume	20
DOIs	https://doi.org/10.1109/LGRS.2022.3229692
State	Published - 2023

Keywords

Hyperspectral image (HSI) and multispectral image (MSI) fusion
multihead self-attention (MSA)
remote sensing
Transformer

Access to Document

10.1109/LGRS.2022.3229692

Cite this

@article{034707c4c8994bfe8a2365305fcc78fb,

title = "HMF-Former: Spatio-Spectral Transformer for Hyperspectral and Multispectral Image Fusion",

abstract = "The key to hyperspectral image (HSI) and multispectral image (MSI) fusion is to take advantage of the properties of interspectra self-similarities of HSIs and spatial correlations of MSIs. However, leading convolutional neural network (CNN)-based methods show shortcomings in capturing long-range dependencies and self-similarity prior. To this end, we propose a simple yet efficient Transformer-based network, hyperspectral and multispectral image fusion (HMF)-Former, for the HSI/MSI fusion. The HMF-Former adopts a U-shaped architecture with a spatio-spectral Transformer block (SSTB) as the basic unit. In the SSTB, embedded spatial-wise multihead self-attention (Spa-MSA) and spectral-wise multihead self-attention (Spe-MSA) effectively capture interactions of spatial regions and interspectra dependencies, respectively. They are consistent with the properties of spatial correlations of MSIs and interspectra self-similarities of HSIs. In addition, specially designed SSTB enables the HMF-Former to capture both local and global features while maintaining linear complexity. Extensive experiments on four benchmark datasets show that our method significantly outperforms state-of-the-art methods.",

keywords = "Hyperspectral image (HSI) and multispectral image (MSI) fusion, multihead self-attention (MSA), remote sensing, Transformer",

author = "Tengfei You and Chanyue Wu and Yunpeng Bai and Dong Wang and Huibin Ge and Ying Li",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2023",

doi = "10.1109/LGRS.2022.3229692",

language = "英语",

volume = "20",

journal = "IEEE Geoscience and Remote Sensing Letters",

issn = "1545-598X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - HMF-Former

T2 - Spatio-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

AU - You, Tengfei

AU - Wu, Chanyue

AU - Bai, Yunpeng

AU - Wang, Dong

AU - Ge, Huibin

AU - Li, Ying

PY - 2023

Y1 - 2023

N2 - The key to hyperspectral image (HSI) and multispectral image (MSI) fusion is to take advantage of the properties of interspectra self-similarities of HSIs and spatial correlations of MSIs. However, leading convolutional neural network (CNN)-based methods show shortcomings in capturing long-range dependencies and self-similarity prior. To this end, we propose a simple yet efficient Transformer-based network, hyperspectral and multispectral image fusion (HMF)-Former, for the HSI/MSI fusion. The HMF-Former adopts a U-shaped architecture with a spatio-spectral Transformer block (SSTB) as the basic unit. In the SSTB, embedded spatial-wise multihead self-attention (Spa-MSA) and spectral-wise multihead self-attention (Spe-MSA) effectively capture interactions of spatial regions and interspectra dependencies, respectively. They are consistent with the properties of spatial correlations of MSIs and interspectra self-similarities of HSIs. In addition, specially designed SSTB enables the HMF-Former to capture both local and global features while maintaining linear complexity. Extensive experiments on four benchmark datasets show that our method significantly outperforms state-of-the-art methods.

AB - The key to hyperspectral image (HSI) and multispectral image (MSI) fusion is to take advantage of the properties of interspectra self-similarities of HSIs and spatial correlations of MSIs. However, leading convolutional neural network (CNN)-based methods show shortcomings in capturing long-range dependencies and self-similarity prior. To this end, we propose a simple yet efficient Transformer-based network, hyperspectral and multispectral image fusion (HMF)-Former, for the HSI/MSI fusion. The HMF-Former adopts a U-shaped architecture with a spatio-spectral Transformer block (SSTB) as the basic unit. In the SSTB, embedded spatial-wise multihead self-attention (Spa-MSA) and spectral-wise multihead self-attention (Spe-MSA) effectively capture interactions of spatial regions and interspectra dependencies, respectively. They are consistent with the properties of spatial correlations of MSIs and interspectra self-similarities of HSIs. In addition, specially designed SSTB enables the HMF-Former to capture both local and global features while maintaining linear complexity. Extensive experiments on four benchmark datasets show that our method significantly outperforms state-of-the-art methods.

KW - Hyperspectral image (HSI) and multispectral image (MSI) fusion

KW - multihead self-attention (MSA)

KW - remote sensing

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85144786324&partnerID=8YFLogxK

U2 - 10.1109/LGRS.2022.3229692

DO - 10.1109/LGRS.2022.3229692

M3 - 文章

AN - SCOPUS:85144786324

SN - 1545-598X

VL - 20

JO - IEEE Geoscience and Remote Sensing Letters

JF - IEEE Geoscience and Remote Sensing Letters

M1 - 5500505

ER -

HMF-Former: Spatio-Spectral Transformer for Hyperspectral and Multispectral Image Fusion

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this