Novel view synthesis from only a 6-DoF camera pose by two-stage networks

Xiang Guo; Bo Li; Yuchao Dai; Tongxin Zhang; Hui Deng

doi:10.1109/ICPR48806.2021.9413261

Novel view synthesis from only a 6-DoF camera pose by two-stage networks

Xiang Guo, Bo Li, Yuchao Dai, Tongxin Zhang, Hui Deng

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Novel view synthesis is a challenging problem in computer vision and robotics. Different from the existing works, which need the reference images or 3D models of the scene to generate images under novel views, we propose a novel paradigm to this problem. That is, we synthesize the novel view from only a 6-DoF camera pose directly. Although this setting is the most straightforward way, there are few works addressing it. While, our experiments demonstrate that, with a concise CNN, we could get a meaningful parametric model that could reconstruct the correct scenery images only from the 6-DoF pose. To this end, we propose a two-stage learning strategy, which consists of two consecutive CNNs: GenNet and RefineNet. GenNet generates a coarse image from a camera pose. RefineNet is a generative adversarial network that refines the coarse image. In this way, we decouple the geometric relationship between mapping and texture detail rendering. Extensive experiments conducted on the public datasets prove the effectiveness of our method. We believe this paradigm is of high research and application value and could be an important direction in novel view synthesis.

Original language	English
Title of host publication	Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	5028-5035
Number of pages	8
ISBN (Electronic)	9781728188089
DOIs	https://doi.org/10.1109/ICPR48806.2021.9413261
State	Published - 2020
Event	25th International Conference on Pattern Recognition, ICPR 2020 - Virtual, Milan, Italy Duration: 10 Jan 2021 → 15 Jan 2021

Publication series

Name	Proceedings - International Conference on Pattern Recognition
ISSN (Print)	1051-4651

Conference

Conference	25th International Conference on Pattern Recognition, ICPR 2020
Country/Territory	Italy
City	Virtual, Milan
Period	10/01/21 → 15/01/21

Access to Document

10.1109/ICPR48806.2021.9413261

Cite this

Guo, X., Li, B., Dai, Y., Zhang, T., & Deng, H. (2020). Novel view synthesis from only a 6-DoF camera pose by two-stage networks. In Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition (pp. 5028-5035). Article 9413261 (Proceedings - International Conference on Pattern Recognition). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICPR48806.2021.9413261

@inproceedings{8667d3e179ed48239862fa68f65145c5,

title = "Novel view synthesis from only a 6-DoF camera pose by two-stage networks",

abstract = "Novel view synthesis is a challenging problem in computer vision and robotics. Different from the existing works, which need the reference images or 3D models of the scene to generate images under novel views, we propose a novel paradigm to this problem. That is, we synthesize the novel view from only a 6-DoF camera pose directly. Although this setting is the most straightforward way, there are few works addressing it. While, our experiments demonstrate that, with a concise CNN, we could get a meaningful parametric model that could reconstruct the correct scenery images only from the 6-DoF pose. To this end, we propose a two-stage learning strategy, which consists of two consecutive CNNs: GenNet and RefineNet. GenNet generates a coarse image from a camera pose. RefineNet is a generative adversarial network that refines the coarse image. In this way, we decouple the geometric relationship between mapping and texture detail rendering. Extensive experiments conducted on the public datasets prove the effectiveness of our method. We believe this paradigm is of high research and application value and could be an important direction in novel view synthesis.",

author = "Xiang Guo and Bo Li and Yuchao Dai and Tongxin Zhang and Hui Deng",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE; 25th International Conference on Pattern Recognition, ICPR 2020 ; Conference date: 10-01-2021 Through 15-01-2021",

year = "2020",

doi = "10.1109/ICPR48806.2021.9413261",

language = "英语",

series = "Proceedings - International Conference on Pattern Recognition",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5028--5035",

booktitle = "Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition",

}

Guo, X, Li, B, Dai, Y, Zhang, T & Deng, H 2020, Novel view synthesis from only a 6-DoF camera pose by two-stage networks. in Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition., 9413261, Proceedings - International Conference on Pattern Recognition, Institute of Electrical and Electronics Engineers Inc., pp. 5028-5035, 25th International Conference on Pattern Recognition, ICPR 2020, Virtual, Milan, Italy, 10/01/21. https://doi.org/10.1109/ICPR48806.2021.9413261

Novel view synthesis from only a 6-DoF camera pose by two-stage networks. / Guo, Xiang; Li, Bo; Dai, Yuchao et al.
Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 2020. p. 5028-5035 9413261 (Proceedings - International Conference on Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Novel view synthesis from only a 6-DoF camera pose by two-stage networks

AU - Guo, Xiang

AU - Li, Bo

AU - Dai, Yuchao

AU - Zhang, Tongxin

AU - Deng, Hui

PY - 2020

Y1 - 2020

N2 - Novel view synthesis is a challenging problem in computer vision and robotics. Different from the existing works, which need the reference images or 3D models of the scene to generate images under novel views, we propose a novel paradigm to this problem. That is, we synthesize the novel view from only a 6-DoF camera pose directly. Although this setting is the most straightforward way, there are few works addressing it. While, our experiments demonstrate that, with a concise CNN, we could get a meaningful parametric model that could reconstruct the correct scenery images only from the 6-DoF pose. To this end, we propose a two-stage learning strategy, which consists of two consecutive CNNs: GenNet and RefineNet. GenNet generates a coarse image from a camera pose. RefineNet is a generative adversarial network that refines the coarse image. In this way, we decouple the geometric relationship between mapping and texture detail rendering. Extensive experiments conducted on the public datasets prove the effectiveness of our method. We believe this paradigm is of high research and application value and could be an important direction in novel view synthesis.

AB - Novel view synthesis is a challenging problem in computer vision and robotics. Different from the existing works, which need the reference images or 3D models of the scene to generate images under novel views, we propose a novel paradigm to this problem. That is, we synthesize the novel view from only a 6-DoF camera pose directly. Although this setting is the most straightforward way, there are few works addressing it. While, our experiments demonstrate that, with a concise CNN, we could get a meaningful parametric model that could reconstruct the correct scenery images only from the 6-DoF pose. To this end, we propose a two-stage learning strategy, which consists of two consecutive CNNs: GenNet and RefineNet. GenNet generates a coarse image from a camera pose. RefineNet is a generative adversarial network that refines the coarse image. In this way, we decouple the geometric relationship between mapping and texture detail rendering. Extensive experiments conducted on the public datasets prove the effectiveness of our method. We believe this paradigm is of high research and application value and could be an important direction in novel view synthesis.

UR - http://www.scopus.com/inward/record.url?scp=85110537056&partnerID=8YFLogxK

U2 - 10.1109/ICPR48806.2021.9413261

DO - 10.1109/ICPR48806.2021.9413261

M3 - 会议稿件

AN - SCOPUS:85110537056

T3 - Proceedings - International Conference on Pattern Recognition

SP - 5028

EP - 5035

BT - Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 25th International Conference on Pattern Recognition, ICPR 2020

Y2 - 10 January 2021 through 15 January 2021

ER -

Guo X, Li B, Dai Y, Zhang T, Deng H. Novel view synthesis from only a 6-DoF camera pose by two-stage networks. In Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition. Institute of Electrical and Electronics Engineers Inc. 2020. p. 5028-5035. 9413261. (Proceedings - International Conference on Pattern Recognition). doi: 10.1109/ICPR48806.2021.9413261

Novel view synthesis from only a 6-DoF camera pose by two-stage networks

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this