Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images

Delong Yang; Zhaohui Luo; Peng Shang; Zhigang Hu

doi:10.1109/ICTLE53360.2021.9525746

Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images

Delong Yang, Zhaohui Luo, Peng Shang, Zhigang Hu

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

3 引用（Scopus）

摘要

Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.

源语言	英语
主期刊名	2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021
出版商	Institute of Electrical and Electronics Engineers Inc.
页	51-56
页数	6
ISBN（电子版）	9781665427524
DOI	https://doi.org/10.1109/ICTLE53360.2021.9525746
出版状态	已出版 - 9 8月 2021
已对外发布	是
活动	9th International Conference on Traffic and Logistic Engineering, ICTLE 2021 - Virtual, Macau, 中国期限: 9 8月 2021 → 11 8月 2021

出版系列

姓名	2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021

会议

会议	9th International Conference on Traffic and Logistic Engineering, ICTLE 2021
国家/地区	中国
市	Virtual, Macau
时期	9/08/21 → 11/08/21

访问文件

10.1109/ICTLE53360.2021.9525746

其它文件与链接

链接到 Scopus 的出版物

引用此

Yang, D., Luo, Z., Shang, P., & Hu, Z. (2021). Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images. 在 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021 (页码 51-56). (2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICTLE53360.2021.9525746

Yang, Delong ; Luo, Zhaohui ; Shang, Peng 等. / Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images. 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 51-56 (2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021).

@inproceedings{73dd122b666741f9b922fd3fba28a8d6,

title = "Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images",

abstract = "Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.",

keywords = "deep learning, depth estimation, ego-motion, otpical flow",

author = "Delong Yang and Zhaohui Luo and Peng Shang and Zhigang Hu",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021 ; Conference date: 09-08-2021 Through 11-08-2021",

year = "2021",

month = aug,

day = "9",

doi = "10.1109/ICTLE53360.2021.9525746",

language = "英语",

series = "2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "51--56",

booktitle = "2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021",

}

Yang, D, Luo, Z, Shang, P & Hu, Z 2021, Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images. 在 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021. 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021, Institute of Electrical and Electronics Engineers Inc., 页码 51-56, 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021, Virtual, Macau, 中国, 9/08/21. https://doi.org/10.1109/ICTLE53360.2021.9525746

Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images. / Yang, Delong; Luo, Zhaohui; Shang, Peng 等.
2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021. Institute of Electrical and Electronics Engineers Inc., 2021. 页码 51-56 (2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images

AU - Yang, Delong

AU - Luo, Zhaohui

AU - Shang, Peng

AU - Hu, Zhigang

PY - 2021/8/9

Y1 - 2021/8/9

N2 - Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.

AB - Unsupervised deep learning methods have demonstrated an impressive performance for understanding the structure of 3D scene from videos. These data-based learning methods are able to learn the tasks, such as depth, ego-motion, and optical flow estimation. In this paper, we propose a novel unsupervised deep learning method to jointly estimate scene depth, camera ego-motion, and optical flow from stereo images. Consecutive stereo images are used to train the system. After training stage, the system is able to estimate dense depth map, camera 6D pose, and optical flow by using a sequence of monocular images. No labelled data set is required for training. The supervision signals for training three deep neural networks of the system come from various forms of image warping. Due to the use of optical flow, the impact caused by occlusions and moving objects on the estimation results is alleviated. Experiments on the KITTI and Cityscapes datasets show that the proposed system demonstrates a better performance in terms of accuracy in depth, ego-motion, and optical flow estimation.

KW - deep learning

KW - depth estimation

KW - ego-motion

KW - otpical flow

UR - http://www.scopus.com/inward/record.url?scp=85115445359&partnerID=8YFLogxK

U2 - 10.1109/ICTLE53360.2021.9525746

DO - 10.1109/ICTLE53360.2021.9525746

M3 - 会议稿件

AN - SCOPUS:85115445359

T3 - 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021

SP - 51

EP - 56

BT - 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021

Y2 - 9 August 2021 through 11 August 2021

ER -

Yang D, Luo Z, Shang P, Hu Z. Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images. 在 2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021. Institute of Electrical and Electronics Engineers Inc. 2021. 页码 51-56. (2021 9th International Conference on Traffic and Logistic Engineering, ICTLE 2021). doi: 10.1109/ICTLE53360.2021.9525746

Unsupervised Deep Learning of Depth, Ego-Motion, and Optical Flow from Stereo Images

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此