A method to build multi-scene datasets for CNN for camera pose regression

Yuhao Ma, Hao Guo, Hong Chen, Mengxiao Tian, Xin Huo, Chengjiang Long, Shiye Tang, Xiaoyu Song, Qing Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Convolutional neural networks (CNN) have shown to be useful for camera pose regression, and They have robust effects against some challenging scenarios such as lighting changes, motion blur, and scenes with lots of textureless surfaces. Additionally, PoseNet shows that the deep learning system can interpolate the camera pose in space between training images. In this paper, we explore how different strategies for processing datasets will affect the pose regression and propose a method for building multi-scene datasets for training such neural networks. We demonstrate that the location of several scenes can be remembered using only one neural network. By combining multiple scenes, we found that the position errors of the neural network do not decrease significantly as the distance between the cameras increases, which means that we do not need to train several models for the increase number of scenes. We also explore the impact factors that influence the accuracy of models for multi-scene camera pose regression, which can help us merge several scenes into one dataset in a better way. We opened our code and datasets to the public for better researches.

Original languageEnglish
Title of host publicationProceedings - 2018 IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages108-115
Number of pages8
ISBN (Electronic)9781538692691
DOIs
StatePublished - 2 Jul 2018
Externally publishedYes
Event1st IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018 - Taichung, Taiwan, Province of China
Duration: 10 Dec 201812 Dec 2018

Publication series

NameProceedings - 2018 IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018

Conference

Conference1st IEEE International Conference on Artificial Intelligence and Virtual Reality, AIVR 2018
Country/TerritoryTaiwan, Province of China
CityTaichung
Period10/12/1812/12/18

Keywords

  • Camera pose Estimation
  • Convolutional neural network
  • Dataset
  • Visual localization

Fingerprint

Dive into the research topics of 'A method to build multi-scene datasets for CNN for camera pose regression'. Together they form a unique fingerprint.

Cite this