摘要
Aiming at the problem that the traditional optimization algorithm is difficult to get the desired results in a short time in the research of multi-UAV (unmanned aerial vehicle) task decision-making method, this paper proposes a multi-agent deep deterministic policy gradient (MADDPG) algorithm based on deep reinforcement learning. It allows UAVs to use global information in learning and only local information in application decision-making. The model structure of MADDPG algorithm is designed. Finally, through simulation experiments and comparing with deep deterministic policy gradient (DDPG) algorithm, it is verified that the MADDPG algorithm proposed in this paper can greatly improve the learning speed on the basis of ensuring the accuracy, and make up for the shortcomings of the traditional reinforcement learning algorithm in the field of multiple agents.
投稿的翻译标题 | Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient |
---|---|
源语言 | 繁体中文 |
页(从-至) | 757-765 |
页数 | 9 |
期刊 | Yuhang Xuebao/Journal of Astronautics |
卷 | 42 |
期 | 6 |
DOI | |
出版状态 | 已出版 - 30 6月 2021 |
关键词
- Deep reinforcement learning
- Multi-agent
- Policy gradient
- Task decision-making
- UAV