TY - GEN
T1 - View Synthesis with Multi-scale Cost Aggregation and Confidence Prior
AU - Wu, Qi
AU - Wang, Xue
AU - Wang, Qing
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.
AB - This paper presents a learning-based novel view synthesis (NVS) approach from wide-baseline image pairs. Inspired by prior work, we first predict a depth probability volume which represents the scene structure as a set of depth probability layers (DPLs) within a reference view frustum. To reduce geometric uncertainty in ambiguous regions between input images, a multi-scale cost aggregation network is proposed to generate the DPLs for both input views without supervision. Furthermore, to mitigate the depth discretizaiton artifacts in distant views, we calculate the disparity map of the target view by passing the warped DPLs onto the target view to a CNN-based fusion network. Finally the predicted view could be obtained by incorporating the disparity map, warped input images and the confidence prior together. The proposed method improves the performance on challenging scenarios such as texture-less or non-textured regions, occlusion boundaries, non-Lambertian surfaces, and distant viewpoints. Experimental results show that our method achieves state-of-the-art view interpolation and extrapolation results on RealEstate10K mini dataset.
KW - Confidence prior
KW - Multi-scale cost aggregation
KW - Sparse views
KW - View synthesis
KW - Wide baseline
UR - http://www.scopus.com/inward/record.url?scp=85124285874&partnerID=8YFLogxK
U2 - 10.1109/DICTA52665.2021.9647048
DO - 10.1109/DICTA52665.2021.9647048
M3 - 会议稿件
AN - SCOPUS:85124285874
T3 - DICTA 2021 - 2021 International Conference on Digital Image Computing: Techniques and Applications
BT - DICTA 2021 - 2021 International Conference on Digital Image Computing
A2 - Zhou, Jun
A2 - Salvado, Olivier
A2 - Sohel, Ferdous
A2 - Borges, Paulo Vinicius K.
A2 - Wang, Shilin
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2021
Y2 - 29 November 2021 through 1 December 2021
ER -