MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention

Zeyu Cheng; Yi Zhang; Xingxing Zhu; Yang Yu; Zhe Song; Chengkai Tang

doi:10.1109/ICSPCC62635.2024.10770511

MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention

Zeyu Cheng, Yi Zhang, Xingxing Zhu, Yang Yu, Zhe Song, Chengkai Tang

电子信息学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Self-supervised monocular depth estimation plays an extremely important role in fields such as autonomous driving and intelligent robot navigation. However, general monocular depth estimation models require massive computing resources, which seriously hinders their deployment on mobile devices, which is urgently needed in fields such as autonomous driving. To address this problem, we propose MFCS-Depth, an economical monocular depth estimation method based on multi-scale fusion and channel separation attention mechanism. We use the Transformer architecture with linear self-attention as its encoder to ensure its global modeling and economy. A high-performance and low-cost decoder has also been designed to improve the local and global reasoning of the network through multi-scale attention fusion and uses scale-wise channel separation to reduce parameters and computing costs significantly. Extensive experiments show that MFCS-Depth achieves competitive results with very few parameters on the KITTI and DDAD datasets and achieves state-of-the-art performance among methods of similar size.

源语言	英语
主期刊名	2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9798350366556
DOI	https://doi.org/10.1109/ICSPCC62635.2024.10770511
出版状态	已出版 - 2024
活动	14th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024 - Hybrid, Bali, 印度尼西亚期限: 19 8月 2024 → 22 8月 2024

出版系列

姓名	2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024

会议

会议	14th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024
国家/地区	印度尼西亚
市	Hybrid, Bali
时期	19/08/24 → 22/08/24

访问文件

10.1109/ICSPCC62635.2024.10770511

其它文件与链接

链接到 Scopus 的出版物

引用此

Cheng, Z., Zhang, Y., Zhu, X., Yu, Y., Song, Z., & Tang, C. (2024). MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention. 在 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024 (2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSPCC62635.2024.10770511

Cheng, Zeyu ; Zhang, Yi ; Zhu, Xingxing 等. / MFCS-Depth : An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention. 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. (2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024).

@inproceedings{184a0c29284840d685f96f9e1afb15fa,

title = "MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention",

abstract = "Self-supervised monocular depth estimation plays an extremely important role in fields such as autonomous driving and intelligent robot navigation. However, general monocular depth estimation models require massive computing resources, which seriously hinders their deployment on mobile devices, which is urgently needed in fields such as autonomous driving. To address this problem, we propose MFCS-Depth, an economical monocular depth estimation method based on multi-scale fusion and channel separation attention mechanism. We use the Transformer architecture with linear self-attention as its encoder to ensure its global modeling and economy. A high-performance and low-cost decoder has also been designed to improve the local and global reasoning of the network through multi-scale attention fusion and uses scale-wise channel separation to reduce parameters and computing costs significantly. Extensive experiments show that MFCS-Depth achieves competitive results with very few parameters on the KITTI and DDAD datasets and achieves state-of-the-art performance among methods of similar size.",

keywords = "economical self-supervised monocular depth estimation, multi-scale attention fusion, scale-wise channel separation, Transformer",

author = "Zeyu Cheng and Yi Zhang and Xingxing Zhu and Yang Yu and Zhe Song and Chengkai Tang",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 14th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024 ; Conference date: 19-08-2024 Through 22-08-2024",

year = "2024",

doi = "10.1109/ICSPCC62635.2024.10770511",

language = "英语",

series = "2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024",

}

Cheng, Z, Zhang, Y, Zhu, X, Yu, Y, Song, Z & Tang, C 2024, MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention. 在 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024. 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024, Institute of Electrical and Electronics Engineers Inc., 14th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024, Hybrid, Bali, 印度尼西亚, 19/08/24. https://doi.org/10.1109/ICSPCC62635.2024.10770511

MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention. / Cheng, Zeyu; Zhang, Yi; Zhu, Xingxing 等.
2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024. Institute of Electrical and Electronics Engineers Inc., 2024. (2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - MFCS-Depth

T2 - 14th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024

AU - Cheng, Zeyu

AU - Zhang, Yi

AU - Zhu, Xingxing

AU - Yu, Yang

AU - Song, Zhe

AU - Tang, Chengkai

PY - 2024

Y1 - 2024

N2 - Self-supervised monocular depth estimation plays an extremely important role in fields such as autonomous driving and intelligent robot navigation. However, general monocular depth estimation models require massive computing resources, which seriously hinders their deployment on mobile devices, which is urgently needed in fields such as autonomous driving. To address this problem, we propose MFCS-Depth, an economical monocular depth estimation method based on multi-scale fusion and channel separation attention mechanism. We use the Transformer architecture with linear self-attention as its encoder to ensure its global modeling and economy. A high-performance and low-cost decoder has also been designed to improve the local and global reasoning of the network through multi-scale attention fusion and uses scale-wise channel separation to reduce parameters and computing costs significantly. Extensive experiments show that MFCS-Depth achieves competitive results with very few parameters on the KITTI and DDAD datasets and achieves state-of-the-art performance among methods of similar size.

AB - Self-supervised monocular depth estimation plays an extremely important role in fields such as autonomous driving and intelligent robot navigation. However, general monocular depth estimation models require massive computing resources, which seriously hinders their deployment on mobile devices, which is urgently needed in fields such as autonomous driving. To address this problem, we propose MFCS-Depth, an economical monocular depth estimation method based on multi-scale fusion and channel separation attention mechanism. We use the Transformer architecture with linear self-attention as its encoder to ensure its global modeling and economy. A high-performance and low-cost decoder has also been designed to improve the local and global reasoning of the network through multi-scale attention fusion and uses scale-wise channel separation to reduce parameters and computing costs significantly. Extensive experiments show that MFCS-Depth achieves competitive results with very few parameters on the KITTI and DDAD datasets and achieves state-of-the-art performance among methods of similar size.

KW - economical self-supervised monocular depth estimation

KW - multi-scale attention fusion

KW - scale-wise channel separation

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85214896925&partnerID=8YFLogxK

U2 - 10.1109/ICSPCC62635.2024.10770511

DO - 10.1109/ICSPCC62635.2024.10770511

M3 - 会议稿件

AN - SCOPUS:85214896925

T3 - 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024

BT - 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 19 August 2024 through 22 August 2024

ER -

Cheng Z, Zhang Y, Zhu X, Yu Y, Song Z, Tang C. MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention. 在 2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024. Institute of Electrical and Electronics Engineers Inc. 2024. (2024 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2024). doi: 10.1109/ICSPCC62635.2024.10770511

MFCS-Depth: An Economical Self-Supervised Monocular Depth Estimation Based on Multi-Scale Fusion and Channel Separation Attention

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此