Abstract
Aiming at the problem that the traditional optimization algorithm is difficult to get the desired results in a short time in the research of multi-UAV (unmanned aerial vehicle) task decision-making method, this paper proposes a multi-agent deep deterministic policy gradient (MADDPG) algorithm based on deep reinforcement learning. It allows UAVs to use global information in learning and only local information in application decision-making. The model structure of MADDPG algorithm is designed. Finally, through simulation experiments and comparing with deep deterministic policy gradient (DDPG) algorithm, it is verified that the MADDPG algorithm proposed in this paper can greatly improve the learning speed on the basis of ensuring the accuracy, and make up for the shortcomings of the traditional reinforcement learning algorithm in the field of multiple agents.
Translated title of the contribution | Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient |
---|---|
Original language | Chinese (Traditional) |
Pages (from-to) | 757-765 |
Number of pages | 9 |
Journal | Yuhang Xuebao/Journal of Astronautics |
Volume | 42 |
Issue number | 6 |
DOIs | |
State | Published - 30 Jun 2021 |