A method to build multi-scene datasets for CNN for camera pose regression

Yuhao Ma, Hao Guo, Hong Chen, Mengxiao Tian, Xin Huo, Chengjiang Long, Shiye Tang, Xiaoyu Song, Qing Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Convolutional neural networks (CNN) have shown to be useful for camera pose regression, and They have robust effects against some challenging scenarios such as lighting changes, motion blur, and scenes with lots of textureless surfaces. Additionally, PoseNet shows that the deep learning system can interpolate the camera pose in space between training images. In this paper, we explore how different strategies for processing datasets will affect the pose regression and propose a method for building multi-scene datasets for training such neural networks. We demonstrate that the location of several scenes can be remembered using only one neural network. By combining multiple scenes, we found that the position errors of the neural network do not decrease significantly as the distance between the cameras increases, which means that we do not need to train several models for the increase number of scenes. We also explore the impact factors that influence the accuracy of models for multi-scene camera pose regression, which can help us merge several scenes into one dataset in a better way. We opened our code and datasets to the public for better researches.

源语言英语
主期刊名Proceedings - 2018 IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018
出版商Institute of Electrical and Electronics Engineers Inc.
108-115
页数8
ISBN(电子版)9781538692691
DOI
出版状态已出版 - 2 7月 2018
已对外发布
活动1st IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018 - Taichung, 中国台湾
期限: 10 12月 201812 12月 2018

出版系列

姓名Proceedings - 2018 IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018

会议

会议1st IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018
国家/地区中国台湾
Taichung
时期10/12/1812/12/18

指纹

探究 'A method to build multi-scene datasets for CNN for camera pose regression' 的科研主题。它们共同构成独一无二的指纹。

引用此