Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection

Xincheng Pang, Wenke Xia, Zhigang Wang, Bin Zhao, Di Hu, Dong Wang, Xuelong Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

3D perception ability is crucial for generalizable robotic manipulation. While recent foundation models have made significant strides in perception and decision-making with RGB-based input, their lack of 3D perception limits their effectiveness in fine-grained robotic manipulation tasks. To address these limitations, we propose a Depth Information Injection (DI2) framework that leverages the RGB-Depth modality for policy fine-tuning, while relying solely on RGB images for robust and efficient deployment. Concretely, we introduce the Depth Completion Module (DCM) to extract the spatial prior knowledge related to depth information and generate virtual depth information from RGB inputs to aid policy deployment. Further, we propose the Depth-Aware Codebook (DAC) to eliminate noise and reduce the cumulative error from the depth prediction. In the inference phase, this framework employs RGB inputs and accurately predicted depth data to generate the manipulation action. We conduct experiments on simulated LIBERO environments and real-world scenarios, and the experiment results prove that our method could effectively enhance the pre-trained RGB-based policy with 3D perception ability for robotic manipulation. The website is released at https://gewu-lab.github.io/DepthHelps-IROS2024.

源语言英语
主期刊名2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
出版商Institute of Electrical and Electronics Engineers Inc.
7251-7256
页数6
ISBN(电子版)9798350377705
DOI
出版状态已出版 - 2024
活动2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024 - Abu Dhabi, 阿拉伯联合酋长国
期限: 14 10月 202418 10月 2024

出版系列

姓名IEEE International Conference on Intelligent Robots and Systems
ISSN(印刷版)2153-0858
ISSN(电子版)2153-0866

会议

会议2024 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2024
国家/地区阿拉伯联合酋长国
Abu Dhabi
时期14/10/2418/10/24

指纹

探究 'Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection' 的科研主题。它们共同构成独一无二的指纹。

引用此