MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry

Yuchao Dai; Zhidong Zhu; Zhibo Rao; Bo Li

doi:10.1109/3DV.2019.00010

MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry

Yuchao Dai, Zhidong Zhu, Zhibo Rao, Bo Li

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

85 Scopus citations

Abstract

The success of existing deep-learning based multi-view stereo (MVS) approaches greatly depends on the availability of large-scale supervision in the form of dense depth maps. Such supervision, while not always possible, tends to hinder the generalization ability of the learned models in never-seen-before scenarios. In this paper, we propose the first unsupervised learning based MVS network, which learns the multi-view depth maps from the input multi-view images and does not need ground-truth 3D training data. Our network is symmetric in predicting depth maps for all views simultaneously, where we enforce cross-view consistency of multi-view depth maps during both training and testing stages. Thus, the learned multi-view depth maps naturally comply with the underlying 3D scene geometry. Besides, our network also learns the multi-view occlusion maps, which further improves the robustness of our network in handling real-world occlusions. Experimental results on multiple benchmarking datasets demonstrate the effectiveness of our network and the excellent generalization ability.

Original language	English
Title of host publication	Proceedings - 2019 International Conference on 3D Vision, 3DV 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1-8
Number of pages	8
ISBN (Electronic)	9781728131313
DOIs	https://doi.org/10.1109/3DV.2019.00010
State	Published - Sep 2019
Event	7th International Conference on 3D Vision, 3DV 2019 - Quebec, Canada Duration: 15 Sep 2019 → 18 Sep 2019

Publication series

Name	Proceedings - 2019 International Conference on 3D Vision, 3DV 2019

Conference

Conference	7th International Conference on 3D Vision, 3DV 2019
Country/Territory	Canada
City	Quebec
Period	15/09/19 → 18/09/19

Keywords

multi view stereo
multi view symmetry
unsupervised learning

Access to Document

10.1109/3DV.2019.00010

Cite this

Dai, Y., Zhu, Z., Rao, Z., & Li, B. (2019). MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry. In Proceedings - 2019 International Conference on 3D Vision, 3DV 2019 (pp. 1-8). Article 8885975 (Proceedings - 2019 International Conference on 3D Vision, 3DV 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/3DV.2019.00010

@inproceedings{6bd4fc6d52ad42e1b2902a9fe890d05d,

title = "MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry",

abstract = "The success of existing deep-learning based multi-view stereo (MVS) approaches greatly depends on the availability of large-scale supervision in the form of dense depth maps. Such supervision, while not always possible, tends to hinder the generalization ability of the learned models in never-seen-before scenarios. In this paper, we propose the first unsupervised learning based MVS network, which learns the multi-view depth maps from the input multi-view images and does not need ground-truth 3D training data. Our network is symmetric in predicting depth maps for all views simultaneously, where we enforce cross-view consistency of multi-view depth maps during both training and testing stages. Thus, the learned multi-view depth maps naturally comply with the underlying 3D scene geometry. Besides, our network also learns the multi-view occlusion maps, which further improves the robustness of our network in handling real-world occlusions. Experimental results on multiple benchmarking datasets demonstrate the effectiveness of our network and the excellent generalization ability.",

keywords = "multi view stereo, multi view symmetry, unsupervised learning",

author = "Yuchao Dai and Zhidong Zhu and Zhibo Rao and Bo Li",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 7th International Conference on 3D Vision, 3DV 2019 ; Conference date: 15-09-2019 Through 18-09-2019",

year = "2019",

month = sep,

doi = "10.1109/3DV.2019.00010",

language = "英语",

series = "Proceedings - 2019 International Conference on 3D Vision, 3DV 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1--8",

booktitle = "Proceedings - 2019 International Conference on 3D Vision, 3DV 2019",

}

Dai, Y, Zhu, Z, Rao, Z & Li, B 2019, MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry. in Proceedings - 2019 International Conference on 3D Vision, 3DV 2019., 8885975, Proceedings - 2019 International Conference on 3D Vision, 3DV 2019, Institute of Electrical and Electronics Engineers Inc., pp. 1-8, 7th International Conference on 3D Vision, 3DV 2019, Quebec, Canada, 15/09/19. https://doi.org/10.1109/3DV.2019.00010

MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry. / Dai, Yuchao; Zhu, Zhidong; Rao, Zhibo et al.
Proceedings - 2019 International Conference on 3D Vision, 3DV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 1-8 8885975 (Proceedings - 2019 International Conference on 3D Vision, 3DV 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - MVS2

T2 - 7th International Conference on 3D Vision, 3DV 2019

AU - Dai, Yuchao

AU - Zhu, Zhidong

AU - Rao, Zhibo

AU - Li, Bo

PY - 2019/9

Y1 - 2019/9

N2 - The success of existing deep-learning based multi-view stereo (MVS) approaches greatly depends on the availability of large-scale supervision in the form of dense depth maps. Such supervision, while not always possible, tends to hinder the generalization ability of the learned models in never-seen-before scenarios. In this paper, we propose the first unsupervised learning based MVS network, which learns the multi-view depth maps from the input multi-view images and does not need ground-truth 3D training data. Our network is symmetric in predicting depth maps for all views simultaneously, where we enforce cross-view consistency of multi-view depth maps during both training and testing stages. Thus, the learned multi-view depth maps naturally comply with the underlying 3D scene geometry. Besides, our network also learns the multi-view occlusion maps, which further improves the robustness of our network in handling real-world occlusions. Experimental results on multiple benchmarking datasets demonstrate the effectiveness of our network and the excellent generalization ability.

AB - The success of existing deep-learning based multi-view stereo (MVS) approaches greatly depends on the availability of large-scale supervision in the form of dense depth maps. Such supervision, while not always possible, tends to hinder the generalization ability of the learned models in never-seen-before scenarios. In this paper, we propose the first unsupervised learning based MVS network, which learns the multi-view depth maps from the input multi-view images and does not need ground-truth 3D training data. Our network is symmetric in predicting depth maps for all views simultaneously, where we enforce cross-view consistency of multi-view depth maps during both training and testing stages. Thus, the learned multi-view depth maps naturally comply with the underlying 3D scene geometry. Besides, our network also learns the multi-view occlusion maps, which further improves the robustness of our network in handling real-world occlusions. Experimental results on multiple benchmarking datasets demonstrate the effectiveness of our network and the excellent generalization ability.

KW - multi view stereo

KW - multi view symmetry

KW - unsupervised learning

UR - http://www.scopus.com/inward/record.url?scp=85075013801&partnerID=8YFLogxK

U2 - 10.1109/3DV.2019.00010

DO - 10.1109/3DV.2019.00010

M3 - 会议稿件

AN - SCOPUS:85075013801

T3 - Proceedings - 2019 International Conference on 3D Vision, 3DV 2019

SP - 1

EP - 8

BT - Proceedings - 2019 International Conference on 3D Vision, 3DV 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 15 September 2019 through 18 September 2019

ER -

MVS2: Deep Unsupervised Multi-View Stereo with Multi-View Symmetry

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this