TY - JOUR
T1 - MSPFusion
T2 - A feature transformer for multidimensional spectral-polarization image fusion
AU - Tong, Geng
AU - Yao, Xinling
AU - Li, Ben
AU - Fu, Jiaye
AU - Wang, Yan
AU - Hao, Jia
AU - Karim, Shahid
AU - Yu, Yiting
N1 - Publisher Copyright:
© 2025
PY - 2025/5/25
Y1 - 2025/5/25
N2 - Spectral images capture reflection characteristics, texture, and spatial details, while polarimetric images provide insights into surface roughness, morphology, and material structure. Fusing these complementary sensing modalities overcomes the limitations of individual perception techniques, offering a richer understanding of the scene. However, traditional fusion methods are typically designed for dual-modal inputs and lack robustness when handling multimodal data. To address this, we propose MSPFusion, a novel feature transformer network for multidimensional spectral-polarization image fusion. Our approach leverages a Feature Multi-head Self-attention (F-MSA) mechanism in the encoder to enhance feature extraction and introduces an entropy-based fusion layer to adaptively integrate spectral and polarization information by maximizing information content. The F-MSA decoder further refines the fused features to produce high-quality integration and visualization of spectral-polarization data. We constructed a hyperspectral-polarization dataset, NWPU-SP, to validate our method. Experimental results show that MSPFusion achieves state-of-the-art performance, ranking first in the following metrics, i.e., entropy (EN), information (MI), standard deviation (SD), quality of saliency (QS), and quality of gradient (QG). All these metrics show a better preservation of information, along with improved clarity and contrast in the fusion results of our proposed method. Additionally, MSPFusion demonstrates strong robustness in infrared–visible image fusion tasks, proving its adaptability across various scenarios. The source code and NWPU-SP dataset are available at https://github.com/tgg-77/MSPFusion.
AB - Spectral images capture reflection characteristics, texture, and spatial details, while polarimetric images provide insights into surface roughness, morphology, and material structure. Fusing these complementary sensing modalities overcomes the limitations of individual perception techniques, offering a richer understanding of the scene. However, traditional fusion methods are typically designed for dual-modal inputs and lack robustness when handling multimodal data. To address this, we propose MSPFusion, a novel feature transformer network for multidimensional spectral-polarization image fusion. Our approach leverages a Feature Multi-head Self-attention (F-MSA) mechanism in the encoder to enhance feature extraction and introduces an entropy-based fusion layer to adaptively integrate spectral and polarization information by maximizing information content. The F-MSA decoder further refines the fused features to produce high-quality integration and visualization of spectral-polarization data. We constructed a hyperspectral-polarization dataset, NWPU-SP, to validate our method. Experimental results show that MSPFusion achieves state-of-the-art performance, ranking first in the following metrics, i.e., entropy (EN), information (MI), standard deviation (SD), quality of saliency (QS), and quality of gradient (QG). All these metrics show a better preservation of information, along with improved clarity and contrast in the fusion results of our proposed method. Additionally, MSPFusion demonstrates strong robustness in infrared–visible image fusion tasks, proving its adaptability across various scenarios. The source code and NWPU-SP dataset are available at https://github.com/tgg-77/MSPFusion.
KW - Encoder-decoder
KW - Image fusion
KW - Spectral-polarization image
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=86000167694&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2025.127079
DO - 10.1016/j.eswa.2025.127079
M3 - 文章
AN - SCOPUS:86000167694
SN - 0957-4174
VL - 275
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 127079
ER -