An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning

Dingwei Wu, Kaifang Wan, Jianqiang Tang, Xiaoguang Gao, Yiwei Zhai, Zhaohui Qi

科研成果: 书/报告/会议事项章节会议稿件同行评审

11 引用 (Scopus)

摘要

Autonomous navigation is a key technology of multi-UAV systems, and deep reinforcement learning can endow UAVs with powerful autonomous decision-making capabilities. To improve the convergence speed and stability of reinforcement learning, this paper proposes a multi-agent deep deterministic policy gradient algorithm based on prioritized experience replay, namely PER-MADDPG. This algorithm makes the samples with higher priority have a higher probability of being chosen for the parameter update, which can speed up the algorithm convergence. Moreover, the actions of UAVs are generated utilizing parameter noise, which can improve the stability and robustness of the algorithm. Experiments show that PER-MADDPG has fast convergence speed and good convergence results, and has excellent autonomous navigation capabilities.

源语言英语
主期刊名2022 7th International Conference on Control and Robotics Engineering, ICCRE 2022
出版商Institute of Electrical and Electronics Engineers Inc.
96-101
页数6
ISBN(电子版)9781665468404
DOI
出版状态已出版 - 2022
活动7th International Conference on Control and Robotics Engineering, ICCRE 2022 - Beijing, 中国
期限: 15 4月 202217 4月 2022

出版系列

姓名2022 7th International Conference on Control and Robotics Engineering, ICCRE 2022

会议

会议7th International Conference on Control and Robotics Engineering, ICCRE 2022
国家/地区中国
Beijing
时期15/04/2217/04/22

指纹

探究 'An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此