TY - GEN
T1 - Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images
AU - Yang, Delong
AU - Luo, Zhaohui
AU - Shang, Peng
AU - Hu, Zhigang
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/8/9
Y1 - 2021/8/9
N2 - Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.
AB - Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.
KW - deep learning
KW - depth estimation
KW - ego-motion
KW - otpical flow
UR - http://www.scopus.com/inward/record.url?scp=85115445359&partnerID=8YFLogxK
U2 - 10.1109/ICTLE53360.2021.9525746
DO - 10.1109/ICTLE53360.2021.9525746
M3 - 会议稿件
AN - SCOPUS:85115445359
T3 - 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021
SP - 51
EP - 56
BT - 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021
Y2 - 9 August 2021 through 11 August 2021
ER -