Deep Reinforcement Learning-based End-to-End Navigation of Mobile Robots With Reward Shaping

Yufeng Li, Jian Gao, Yimin Chen, Yaozhen He, Boxu Min

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper proposes an end-to-end autonomous navigation algorithm for unknown environments based on deep reinforcement learning (DRL), which maps the lidar data collected by the robot into control commands. The proposed LM-TD3 algorithm utilizes the Twin Delayed Deep Deterministic(TD3) policy gradient network as the backbone to generate robot action control in continuous spaces. Based on this, the Long Short-Term Memory (LSTM) neural network is introduced into the actor and critic networks, allowing the model to store long-term navigation experiences to increase its ability to perceive and handle surrounding obstacles. Furthermore, a novel reward function in DRL is designed to smooth the motion pose of the robot while controlling the robot to achieve target tracking. Finally, to enhance the early learning efficiency of the DRL network, a Hindsight Experience Replay (HER) strategy is designed specifically for the autonomous navigation system to enhance the convergence speed of the algorithm. To validate the effectiveness of the LM-TD3 algorithm with simulation experiments, scenarios of varying complexities are designed to verify the navigation ability. Compared with the TD3 algorithm, the proposed LMTD3 method can generate shorter paths with enhanced obstacle avoidance capabilities, while also maintaining more stable robot posture control.

源语言英语
主期刊名Proceedings - 2024 IEEE 22nd International Conference on Industrial Informatics, INDIN 2024
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331527471
DOI
出版状态已出版 - 2024
活动22nd IEEE International Conference on Industrial Informatics, INDIN 2024 - Beijing, 中国
期限: 18 8月 202420 8月 2024

出版系列

姓名IEEE International Conference on Industrial Informatics (INDIN)
ISSN(印刷版)1935-4576

会议

会议22nd IEEE International Conference on Industrial Informatics, INDIN 2024
国家/地区中国
Beijing
时期18/08/2420/08/24

指纹

探究 'Deep Reinforcement Learning-based End-to-End Navigation of Mobile Robots With Reward Shaping' 的科研主题。它们共同构成独一无二的指纹。

引用此