基于MADDPG的多无人机协同任务决策

Translated title of the contribution: Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient

Bo Li, Kai Qiang Yue, Zhi Gang Gan, Pei Xin Gao

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

Aiming at the problem that the traditional optimization algorithm is difficult to get the desired results in a short time in the research of multi-UAV (unmanned aerial vehicle) task decision-making method, this paper proposes a multi-agent deep deterministic policy gradient (MADDPG) algorithm based on deep reinforcement learning. It allows UAVs to use global information in learning and only local information in application decision-making. The model structure of MADDPG algorithm is designed. Finally, through simulation experiments and comparing with deep deterministic policy gradient (DDPG) algorithm, it is verified that the MADDPG algorithm proposed in this paper can greatly improve the learning speed on the basis of ensuring the accuracy, and make up for the shortcomings of the traditional reinforcement learning algorithm in the field of multiple agents.

Translated title of the contributionMulti-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient
Original languageChinese (Traditional)
Pages (from-to)757-765
Number of pages9
JournalYuhang Xuebao/Journal of Astronautics
Volume42
Issue number6
DOIs
StatePublished - 30 Jun 2021

Fingerprint

Dive into the research topics of 'Multi-UAV Cooperative Autonomous Navigation Based on Multi-agent Deep Deterministic Policy Gradient'. Together they form a unique fingerprint.

Cite this