WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution

Xibin Song; Dingfu Zhou; Wei Li; Yuchao Dai; Liu Liu; Hongdong Li; Ruigang Yang; Liangjun Zhang

doi:10.1109/TMM.2021.3118282

WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution

Xibin Song, Dingfu Zhou, Wei Li, Yuchao Dai, Liu Liu, Hongdong Li, Ruigang Yang, Liangjun Zhang

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Despite the remarkable progresses achieved in depth map super-resolution (DSR), it remains a major challenge to tackle with real-world degradation of low-resolution (LR) depth maps. Synthetic datasets are mainly used in existing DSR approaches, which is quite different from what would get from a real depth sensor. Besides, the enhancements of features in existing DSR approaches are not sufficiently enough, which also limit the performance. To alleviate these problems, we first propose two types of degradation models to describe the generation of LR depth maps, including bi-cubic down-sampling with noise and interval down-sampling, and different DSR models are learned correspondingly. Then, we propose a weighted attention fusion strategy that is embedded into a progressive residual learning framework, which guarantees that the high-resolution (HR) depth maps can be well recovered in a coarse-To-fine manner. The weighted attention fusion strategy can enhance the features with abundant high-frequency components in both global and local manners, thus better HR depth maps can be expected. Besides, to re-use the effective information in the progressive process sufficiently, a multi-stage fusion module is combined into the proposed framework, and the Total Generalized Variation (TGV) regularization and input loss are exploited to further improve the performance of our method. Extensive experiments of different benchmarks demonstrate the superiority of our approach over the state-of-The-Art (SOTA) approaches.

Original language	English
Pages (from-to)	4113-4127
Number of pages	15
Journal	IEEE Transactions on Multimedia
Volume	24
DOIs	https://doi.org/10.1109/TMM.2021.3118282
State	Published - 2022

Keywords

Attention fusion
depth
residual learning
super-resolution

Access to Document

10.1109/TMM.2021.3118282

Cite this

@article{c1abc6f5029a4f538de238215d111887,

title = "WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution",

abstract = "Despite the remarkable progresses achieved in depth map super-resolution (DSR), it remains a major challenge to tackle with real-world degradation of low-resolution (LR) depth maps. Synthetic datasets are mainly used in existing DSR approaches, which is quite different from what would get from a real depth sensor. Besides, the enhancements of features in existing DSR approaches are not sufficiently enough, which also limit the performance. To alleviate these problems, we first propose two types of degradation models to describe the generation of LR depth maps, including bi-cubic down-sampling with noise and interval down-sampling, and different DSR models are learned correspondingly. Then, we propose a weighted attention fusion strategy that is embedded into a progressive residual learning framework, which guarantees that the high-resolution (HR) depth maps can be well recovered in a coarse-To-fine manner. The weighted attention fusion strategy can enhance the features with abundant high-frequency components in both global and local manners, thus better HR depth maps can be expected. Besides, to re-use the effective information in the progressive process sufficiently, a multi-stage fusion module is combined into the proposed framework, and the Total Generalized Variation (TGV) regularization and input loss are exploited to further improve the performance of our method. Extensive experiments of different benchmarks demonstrate the superiority of our approach over the state-of-The-Art (SOTA) approaches.",

keywords = "Attention fusion, depth, residual learning, super-resolution",

author = "Xibin Song and Dingfu Zhou and Wei Li and Yuchao Dai and Liu Liu and Hongdong Li and Ruigang Yang and Liangjun Zhang",

note = "Publisher Copyright: {\textcopyright} 1999-2012 IEEE.",

year = "2022",

doi = "10.1109/TMM.2021.3118282",

language = "英语",

volume = "24",

pages = "4113--4127",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - WAFP-Net

T2 - Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution

AU - Song, Xibin

AU - Zhou, Dingfu

AU - Li, Wei

AU - Dai, Yuchao

AU - Liu, Liu

AU - Li, Hongdong

AU - Yang, Ruigang

AU - Zhang, Liangjun

PY - 2022

Y1 - 2022

N2 - Despite the remarkable progresses achieved in depth map super-resolution (DSR), it remains a major challenge to tackle with real-world degradation of low-resolution (LR) depth maps. Synthetic datasets are mainly used in existing DSR approaches, which is quite different from what would get from a real depth sensor. Besides, the enhancements of features in existing DSR approaches are not sufficiently enough, which also limit the performance. To alleviate these problems, we first propose two types of degradation models to describe the generation of LR depth maps, including bi-cubic down-sampling with noise and interval down-sampling, and different DSR models are learned correspondingly. Then, we propose a weighted attention fusion strategy that is embedded into a progressive residual learning framework, which guarantees that the high-resolution (HR) depth maps can be well recovered in a coarse-To-fine manner. The weighted attention fusion strategy can enhance the features with abundant high-frequency components in both global and local manners, thus better HR depth maps can be expected. Besides, to re-use the effective information in the progressive process sufficiently, a multi-stage fusion module is combined into the proposed framework, and the Total Generalized Variation (TGV) regularization and input loss are exploited to further improve the performance of our method. Extensive experiments of different benchmarks demonstrate the superiority of our approach over the state-of-The-Art (SOTA) approaches.

AB - Despite the remarkable progresses achieved in depth map super-resolution (DSR), it remains a major challenge to tackle with real-world degradation of low-resolution (LR) depth maps. Synthetic datasets are mainly used in existing DSR approaches, which is quite different from what would get from a real depth sensor. Besides, the enhancements of features in existing DSR approaches are not sufficiently enough, which also limit the performance. To alleviate these problems, we first propose two types of degradation models to describe the generation of LR depth maps, including bi-cubic down-sampling with noise and interval down-sampling, and different DSR models are learned correspondingly. Then, we propose a weighted attention fusion strategy that is embedded into a progressive residual learning framework, which guarantees that the high-resolution (HR) depth maps can be well recovered in a coarse-To-fine manner. The weighted attention fusion strategy can enhance the features with abundant high-frequency components in both global and local manners, thus better HR depth maps can be expected. Besides, to re-use the effective information in the progressive process sufficiently, a multi-stage fusion module is combined into the proposed framework, and the Total Generalized Variation (TGV) regularization and input loss are exploited to further improve the performance of our method. Extensive experiments of different benchmarks demonstrate the superiority of our approach over the state-of-The-Art (SOTA) approaches.

KW - Attention fusion

KW - depth

KW - residual learning

KW - super-resolution

UR - http://www.scopus.com/inward/record.url?scp=85117092706&partnerID=8YFLogxK

U2 - 10.1109/TMM.2021.3118282

DO - 10.1109/TMM.2021.3118282

M3 - 文章

AN - SCOPUS:85117092706

SN - 1520-9210

VL - 24

SP - 4113

EP - 4127

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

ER -

WAFP-Net: Weighted Attention Fusion Based Progressive Residual Learning for Depth Map Super-Resolution

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this