TY - JOUR
T1 - Solving Monocular Sensors Depth Prediction Using MLP-Based Architecture and Multi-Scale Inverse Attention
AU - Cheng, Zeyu
AU - Zhang, Yi
AU - Tang, Chengkai
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2022/8/15
Y1 - 2022/8/15
N2 - Monocular sensors depth prediction has received continuous attention in recent years because of its wide application in autonomous driving, intelligent system navigation and other fields. Convolutional neural networks have dominated monocular depth prediction for a long time, and the recent introduction of Transformer-based and MLP-based architectures in the field of computer vision has provided some new ideas for monocular depth prediction. However, they all have a series of problems such as high computational complexity and excessive parameters. In this paper, we propose MLP-Depth, which is a lightweight monocular depth prediction method based on hierarchical multi-stage MLP, and utilizes depth-wise convolution to improve local modeling capabilities and reduce parameters and computational costs. In addition, we also design a multi-scale inverse attention mechanism to implicitly improve the global expressiveness of MLP-Depth. Our method effectively reduces the number of parameters of monocular depth prediction network using transformer-like architectures, and extensive experiments show that MLP-Depth can achieve competitive results with fewer parameters in challenging outdoor and indoor datasets.
AB - Monocular sensors depth prediction has received continuous attention in recent years because of its wide application in autonomous driving, intelligent system navigation and other fields. Convolutional neural networks have dominated monocular depth prediction for a long time, and the recent introduction of Transformer-based and MLP-based architectures in the field of computer vision has provided some new ideas for monocular depth prediction. However, they all have a series of problems such as high computational complexity and excessive parameters. In this paper, we propose MLP-Depth, which is a lightweight monocular depth prediction method based on hierarchical multi-stage MLP, and utilizes depth-wise convolution to improve local modeling capabilities and reduce parameters and computational costs. In addition, we also design a multi-scale inverse attention mechanism to implicitly improve the global expressiveness of MLP-Depth. Our method effectively reduces the number of parameters of monocular depth prediction network using transformer-like architectures, and extensive experiments show that MLP-Depth can achieve competitive results with fewer parameters in challenging outdoor and indoor datasets.
KW - Hierarchical multi-stage MLP
KW - Monocular sensors depth prediction
KW - Multi-scale inverse attention
UR - http://www.scopus.com/inward/record.url?scp=85134268462&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2022.3187152
DO - 10.1109/JSEN.2022.3187152
M3 - 文章
AN - SCOPUS:85134268462
SN - 1530-437X
VL - 22
SP - 16178
EP - 16189
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 16
ER -