View Synthesis with Multi-scale Cost Aggregation and Confidence Prior

Qi Wu; Xue Wang; Qing Wang

doi:10.1109/DICTA52665.2021.9647048

View Synthesis with Multi-scale Cost Aggregation and Confidence Prior

Qi Wu, Xue Wang, Qing Wang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.

Original language	English
Title of host publication	DICTA 2021 - 2021 International Conference on Digital Image Computing
Subtitle of host publication	Techniques and Applications
Editors	Jun Zhou, Olivier Salvado, Ferdous Sohel, Paulo Vinicius K. Borges, Shilin Wang
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665417099
DOIs	https://doi.org/10.1109/DICTA52665.2021.9647048
State	Published - 2021
Event	2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021 - Gold Coast, Australia Duration: 29 Nov 2021 → 1 Dec 2021

Publication series

Name	DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications

Conference

Conference	2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021
Country/Territory	Australia
City	Gold Coast
Period	29/11/21 → 1/12/21

Keywords

Confidence prior
Multi-scale cost aggregation
Sparse views
View synthesis
Wide baseline

Access to Document

10.1109/DICTA52665.2021.9647048

Cite this

Wu, Q., Wang, X., & Wang, Q. (2021). View Synthesis with Multi-scale Cost Aggregation and Confidence Prior. In J. Zhou, O. Salvado, F. Sohel, P. V. K. Borges, & S. Wang (Eds.), DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications (DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/DICTA52665.2021.9647048

Wu, Qi ; Wang, Xue ; Wang, Qing. / View Synthesis with Multi-scale Cost Aggregation and Confidence Prior. DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications. editor / Jun Zhou ; Olivier Salvado ; Ferdous Sohel ; Paulo Vinicius K. Borges ; Shilin Wang. Institute of Electrical and Electronics Engineers Inc., 2021. (DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications).

@inproceedings{9017b933cbf341c6a6676e241c26738a,

title = "View Synthesis with Multi-scale Cost Aggregation and Confidence Prior",

abstract = "This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.",

keywords = "Confidence prior, Multi-scale cost aggregation, Sparse views, View synthesis, Wide baseline",

author = "Qi Wu and Xue Wang and Qing Wang",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021 ; Conference date: 29-11-2021 Through 01-12-2021",

year = "2021",

doi = "10.1109/DICTA52665.2021.9647048",

language = "英语",

series = "DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

editor = "Jun Zhou and Olivier Salvado and Ferdous Sohel and Borges, {Paulo Vinicius K.} and Shilin Wang",

booktitle = "DICTA 2021 - 2021 International Conference on Digital Image Computing",

}

Wu, Q, Wang, X & Wang, Q 2021, View Synthesis with Multi-scale Cost Aggregation and Confidence Prior. in J Zhou, O Salvado, F Sohel, PVK Borges & S Wang (eds), DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications. DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications, Institute of Electrical and Electronics Engineers Inc., 2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021, Gold Coast, Australia, 29/11/21. https://doi.org/10.1109/DICTA52665.2021.9647048

View Synthesis with Multi-scale Cost Aggregation and Confidence Prior. / Wu, Qi; Wang, Xue; Wang, Qing.
DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications. ed. / Jun Zhou; Olivier Salvado; Ferdous Sohel; Paulo Vinicius K. Borges; Shilin Wang. Institute of Electrical and Electronics Engineers Inc., 2021. (DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - View Synthesis with Multi-scale Cost Aggregation and Confidence Prior

AU - Wu, Qi

AU - Wang, Xue

AU - Wang, Qing

PY - 2021

Y1 - 2021

N2 - This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.

AB - This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.

KW - Confidence prior

KW - Multi-scale cost aggregation

KW - Sparse views

KW - View synthesis

KW - Wide baseline

UR - http://www.scopus.com/inward/record.url?scp=85124285874&partnerID=8YFLogxK

U2 - 10.1109/DICTA52665.2021.9647048

DO - 10.1109/DICTA52665.2021.9647048

M3 - 会议稿件

AN - SCOPUS:85124285874

T3 - DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications

BT - DICTA 2021 - 2021 International Conference on Digital Image Computing

A2 - Zhou, Jun

A2 - Salvado, Olivier

A2 - Sohel, Ferdous

A2 - Borges, Paulo Vinicius K.

A2 - Wang, Shilin

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021

Y2 - 29 November 2021 through 1 December 2021

ER -

Wu Q, Wang X, Wang Q. View Synthesis with Multi-scale Cost Aggregation and Confidence Prior. In Zhou J, Salvado O, Sohel F, Borges PVK, Wang S, editors, DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications. Institute of Electrical and Electronics Engineers Inc. 2021. (DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications). doi: 10.1109/DICTA52665.2021.9647048

View Synthesis with Multi-scale Cost Aggregation and Confidence Prior

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this