TY - JOUR
T1 - A multi-scale feature cross-dimensional interaction network for stereo image super-resolution
AU - Zhang, Jingcheng
AU - Zhu, Yu
AU - Peng, Shengjun
AU - Niu, Axi
AU - Yan, Qingsen
AU - Sun, Jinqiu
AU - Zhang, Yanning
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.
PY - 2025/4
Y1 - 2025/4
N2 - Recently, stereo image super-resolution (SSR) has achieved impressive performance by leveraging both intra-view and inter-view information. However, existing SSR methods often rely on single-scale features for stereo image feature extraction and overlook multi-dimensional feature interactions, resulting in poor visual quality with unclear and insufficiently sharp reconstruction of details. To address these issues and achieve better performance for stereo image super-resolution, we propose a multi-scale feature cross-dimensional interaction network (MFCINet) for SSR. Specifically, to fully exploit intra-view information, we design multi-scale feature extraction blocks to capture abundant multi-scale texture patterns, including the Local Feature Extraction Block (LFEB), Mesoscale Feature Extraction Block (MFEB), and Global Feature Extraction Block (GFEB). We progressively fuse smaller-scale features with larger-scale features, utilizing the local texture information contained in the smaller-scale features to refine the global structure information of the larger-scale features. To explore richer interactions of complementary features, we introduce the Cross-dimensional Attention Interaction Block (CAIB), which calculates attention between complementary features across different spatial positions and channels, facilitating comprehensive interaction among complementary features across various dimensions. Extensive experiments and ablation studies demonstrate that MFCINet better leverages intra-view and inter-view information to reconstruct clear texture details, achieving competitive results and outperforming state-of-the-art methods.
AB - Recently, stereo image super-resolution (SSR) has achieved impressive performance by leveraging both intra-view and inter-view information. However, existing SSR methods often rely on single-scale features for stereo image feature extraction and overlook multi-dimensional feature interactions, resulting in poor visual quality with unclear and insufficiently sharp reconstruction of details. To address these issues and achieve better performance for stereo image super-resolution, we propose a multi-scale feature cross-dimensional interaction network (MFCINet) for SSR. Specifically, to fully exploit intra-view information, we design multi-scale feature extraction blocks to capture abundant multi-scale texture patterns, including the Local Feature Extraction Block (LFEB), Mesoscale Feature Extraction Block (MFEB), and Global Feature Extraction Block (GFEB). We progressively fuse smaller-scale features with larger-scale features, utilizing the local texture information contained in the smaller-scale features to refine the global structure information of the larger-scale features. To explore richer interactions of complementary features, we introduce the Cross-dimensional Attention Interaction Block (CAIB), which calculates attention between complementary features across different spatial positions and channels, facilitating comprehensive interaction among complementary features across various dimensions. Extensive experiments and ablation studies demonstrate that MFCINet better leverages intra-view and inter-view information to reconstruct clear texture details, achieving competitive results and outperforming state-of-the-art methods.
KW - Cross-dimensional attention
KW - Feature fusion
KW - Multi-scale
KW - Stereo image super-resolution
UR - http://www.scopus.com/inward/record.url?scp=85219748652&partnerID=8YFLogxK
U2 - 10.1007/s00530-025-01714-8
DO - 10.1007/s00530-025-01714-8
M3 - 文章
AN - SCOPUS:85219748652
SN - 0942-4962
VL - 31
JO - Multimedia Systems
JF - Multimedia Systems
IS - 2
M1 - 114
ER -