TY - GEN
T1 - An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning
AU - Wu, Dingwei
AU - Wan, Kaifang
AU - Tang, Jianqiang
AU - Gao, Xiaoguang
AU - Zhai, Yiwei
AU - Qi, Zhaohui
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Autonomous navigation is a key technology of multi-UAV systems, and deep reinforcement learning can endow UAVs with powerful autonomous decision-making capabilities. To improve the convergence speed and stability of reinforcement learning, this paper proposes a multi-agent deep deterministic policy gradient algorithm based on prioritized experience replay, namely PER-MADDPG. This algorithm makes the samples with higher priority have a higher probability of being chosen for the parameter update, which can speed up the algorithm convergence. Moreover, the actions of UAVs are generated utilizing parameter noise, which can improve the stability and robustness of the algorithm. Experiments show that PER-MADDPG has fast convergence speed and good convergence results, and has excellent autonomous navigation capabilities.
AB - Autonomous navigation is a key technology of multi-UAV systems, and deep reinforcement learning can endow UAVs with powerful autonomous decision-making capabilities. To improve the convergence speed and stability of reinforcement learning, this paper proposes a multi-agent deep deterministic policy gradient algorithm based on prioritized experience replay, namely PER-MADDPG. This algorithm makes the samples with higher priority have a higher probability of being chosen for the parameter update, which can speed up the algorithm convergence. Moreover, the actions of UAVs are generated utilizing parameter noise, which can improve the stability and robustness of the algorithm. Experiments show that PER-MADDPG has fast convergence speed and good convergence results, and has excellent autonomous navigation capabilities.
KW - autonomous navigation
KW - MADDPG
KW - multi-UAV
KW - prioritized experience replay
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85130622450&partnerID=8YFLogxK
U2 - 10.1109/ICCRE55123.2022.9770236
DO - 10.1109/ICCRE55123.2022.9770236
M3 - 会议稿件
AN - SCOPUS:85130622450
T3 - 2022 7th International Conference on Control and Robotics Engineering, ICCRE 2022
SP - 96
EP - 101
BT - 2022 7th International Conference on Control and Robotics Engineering, ICCRE 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Control and Robotics Engineering, ICCRE 2022
Y2 - 15 April 2022 through 17 April 2022
ER -