MSDC-Net: Multi-scale dense and contextual networks for stereo matching

Zhibo Rao; Mingyi He; Yuchao Dai; Zhidong Zhu; Bo Li; Renjie He

doi:10.1109/APSIPAASC47483.2019.9023237

MSDC-Net: Multi-scale dense and contextual networks for stereo matching

Zhibo Rao, Mingyi He, Yuchao Dai, Zhidong Zhu, Bo Li, Renjie He

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Scopus citations

Abstract

Disparity prediction from stereo images is essential to computer vision applications such as autonomous driving, 3D model reconstruction, and object detection. To more accurately predict disparity map, a novel deep learning architecture (called MSDC-Net) for detecting the disparity map from a rectified pair of stereo images is proposed. Our MSDC-Net contains two modules: the multi-scale fusion 2D convolution module and the multi-scale residual 3D convolution module. The multi-scale fusion 2D convolution module exploits the potential multi-scale features, which extracts and fuses the different scale features by Dense-Net. The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module. Experimental results on Scene Flow and KITTI datasets demonstrate that our MSDC-Net significantly outperforms other approaches in the non-occluded region.

Original language	English
Title of host publication	2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	578-583
Number of pages	6
ISBN (Electronic)	9781728132488
DOIs	https://doi.org/10.1109/APSIPAASC47483.2019.9023237
State	Published - Nov 2019
Event	2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 - Lanzhou, China Duration: 18 Nov 2019 → 21 Nov 2019

Publication series

Name	2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

Conference

Conference	2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
Country/Territory	China
City	Lanzhou
Period	18/11/19 → 21/11/19

Access to Document

10.1109/APSIPAASC47483.2019.9023237

Cite this

Rao, Z., He, M., Dai, Y., Zhu, Z., Li, B., & He, R. (2019). MSDC-Net: Multi-scale dense and contextual networks for stereo matching. In 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 (pp. 578-583). Article 9023237 (2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/APSIPAASC47483.2019.9023237

Rao, Zhibo ; He, Mingyi ; Dai, Yuchao et al. / MSDC-Net : Multi-scale dense and contextual networks for stereo matching. 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 578-583 (2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019).

@inproceedings{bbfcc1ed4a01420a99f3d3c80dfbb087,

title = "MSDC-Net: Multi-scale dense and contextual networks for stereo matching",

abstract = "Disparity prediction from stereo images is essential to computer vision applications such as autonomous driving, 3D model reconstruction, and object detection. To more accurately predict disparity map, a novel deep learning architecture (called MSDC-Net) for detecting the disparity map from a rectified pair of stereo images is proposed. Our MSDC-Net contains two modules: the multi-scale fusion 2D convolution module and the multi-scale residual 3D convolution module. The multi-scale fusion 2D convolution module exploits the potential multi-scale features, which extracts and fuses the different scale features by Dense-Net. The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module. Experimental results on Scene Flow and KITTI datasets demonstrate that our MSDC-Net significantly outperforms other approaches in the non-occluded region.",

author = "Zhibo Rao and Mingyi He and Yuchao Dai and Zhidong Zhu and Bo Li and Renjie He",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019 ; Conference date: 18-11-2019 Through 21-11-2019",

year = "2019",

month = nov,

doi = "10.1109/APSIPAASC47483.2019.9023237",

language = "英语",

series = "2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "578--583",

booktitle = "2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019",

}

Rao, Z, He, M, Dai, Y, Zhu, Z, Li, B & He, R 2019, MSDC-Net: Multi-scale dense and contextual networks for stereo matching. in 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019., 9023237, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, Institute of Electrical and Electronics Engineers Inc., pp. 578-583, 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, Lanzhou, China, 18/11/19. https://doi.org/10.1109/APSIPAASC47483.2019.9023237

MSDC-Net: Multi-scale dense and contextual networks for stereo matching. / Rao, Zhibo; He, Mingyi; Dai, Yuchao et al.
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 578-583 9023237 (2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - MSDC-Net

T2 - 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

AU - Rao, Zhibo

AU - He, Mingyi

AU - Dai, Yuchao

AU - Zhu, Zhidong

AU - Li, Bo

AU - He, Renjie

PY - 2019/11

Y1 - 2019/11

N2 - Disparity prediction from stereo images is essential to computer vision applications such as autonomous driving, 3D model reconstruction, and object detection. To more accurately predict disparity map, a novel deep learning architecture (called MSDC-Net) for detecting the disparity map from a rectified pair of stereo images is proposed. Our MSDC-Net contains two modules: the multi-scale fusion 2D convolution module and the multi-scale residual 3D convolution module. The multi-scale fusion 2D convolution module exploits the potential multi-scale features, which extracts and fuses the different scale features by Dense-Net. The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module. Experimental results on Scene Flow and KITTI datasets demonstrate that our MSDC-Net significantly outperforms other approaches in the non-occluded region.

AB - Disparity prediction from stereo images is essential to computer vision applications such as autonomous driving, 3D model reconstruction, and object detection. To more accurately predict disparity map, a novel deep learning architecture (called MSDC-Net) for detecting the disparity map from a rectified pair of stereo images is proposed. Our MSDC-Net contains two modules: the multi-scale fusion 2D convolution module and the multi-scale residual 3D convolution module. The multi-scale fusion 2D convolution module exploits the potential multi-scale features, which extracts and fuses the different scale features by Dense-Net. The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module. Experimental results on Scene Flow and KITTI datasets demonstrate that our MSDC-Net significantly outperforms other approaches in the non-occluded region.

UR - http://www.scopus.com/inward/record.url?scp=85082388357&partnerID=8YFLogxK

U2 - 10.1109/APSIPAASC47483.2019.9023237

DO - 10.1109/APSIPAASC47483.2019.9023237

M3 - 会议稿件

AN - SCOPUS:85082388357

T3 - 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

SP - 578

EP - 583

BT - 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 18 November 2019 through 21 November 2019

ER -

Rao Z, He M, Dai Y, Zhu Z, Li B, He R. MSDC-Net: Multi-scale dense and contextual networks for stereo matching. In 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 578-583. 9023237. (2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019). doi: 10.1109/APSIPAASC47483.2019.9023237

MSDC-Net: Multi-scale dense and contextual networks for stereo matching

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this