TY - GEN
T1 - Self-supervised Monocular Depth Estimation Method Based on Piecewise Plane Model
AU - Zhang, Weiwei
AU - Zhang, Guanwen
AU - Zhou, Wei
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Monocular depth estimation is crucial in scene understanding and autonomous driving. Recently, a considerable number of methods based on deep learning and piecewise plane geometric priors have made significant progress. However, these methods still face the following issues: 1) Current plane segmentation methods based on pixel color and object contours often yield sub-planes that are mostly curved surfaces, making direct fitting with a single plane model ineffective. 2) Plane prior knowledge ignores the depth distribution of pixels at sub-plane edges, leading to poor depth estimation at these edges.3) Existing methods often rely on difficult-to-obtain depth ground truth as supervision signals. To address the aforementioned issues, we propose a self-supervised monocular depth estimation method based on monocular video. We introduce a multi-plane fusion approach to fit sub-planes in images. We model the pixel depth at sub-plane edges as bimodal distribution and design a dynamic search method to enhance the computation efficiency of the cost volume. We validate the performance of our proposed method on the KITTI and NYU-Depth-v2 datasets.
AB - Monocular depth estimation is crucial in scene understanding and autonomous driving. Recently, a considerable number of methods based on deep learning and piecewise plane geometric priors have made significant progress. However, these methods still face the following issues: 1) Current plane segmentation methods based on pixel color and object contours often yield sub-planes that are mostly curved surfaces, making direct fitting with a single plane model ineffective. 2) Plane prior knowledge ignores the depth distribution of pixels at sub-plane edges, leading to poor depth estimation at these edges.3) Existing methods often rely on difficult-to-obtain depth ground truth as supervision signals. To address the aforementioned issues, we propose a self-supervised monocular depth estimation method based on monocular video. We introduce a multi-plane fusion approach to fit sub-planes in images. We model the pixel depth at sub-plane edges as bimodal distribution and design a dynamic search method to enhance the computation efficiency of the cost volume. We validate the performance of our proposed method on the KITTI and NYU-Depth-v2 datasets.
KW - Deep learning
KW - Planarity prior
KW - Self-supervised monocular depth estimation
UR - http://www.scopus.com/inward/record.url?scp=85205705002&partnerID=8YFLogxK
U2 - 10.1109/ICIEA61579.2024.10664764
DO - 10.1109/ICIEA61579.2024.10664764
M3 - 会议稿件
AN - SCOPUS:85205705002
T3 - 2024 IEEE 19th Conference on Industrial Electronics and Applications, ICIEA 2024
BT - 2024 IEEE 19th Conference on Industrial Electronics and Applications, ICIEA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 19th IEEE Conference on Industrial Electronics and Applications, ICIEA 2024
Y2 - 5 August 2024 through 8 August 2024
ER -