TY - JOUR
T1 - 基于 MASAC 强化学习算法的多无人机协同路径规划
AU - Fang, Chengliang
AU - Yang, Feisheng
AU - Pan, Quan
N1 - Publisher Copyright:
© 2024 Science China Press. All rights reserved.
PY - 2024
Y1 - 2024
N2 - This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.
AB - This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.
KW - multi-agent deep reinforcement learning
KW - multi-agent soft actor critic algorithm
KW - multi-UAV
KW - partially observable Markov decision process
KW - path planning
UR - http://www.scopus.com/inward/record.url?scp=85202673073&partnerID=8YFLogxK
U2 - 10.1360/SSI-2024-0050
DO - 10.1360/SSI-2024-0050
M3 - 文章
AN - SCOPUS:85202673073
SN - 1674-7267
VL - 54
SP - 1871
EP - 1883
JO - Scientia Sinica Informationis
JF - Scientia Sinica Informationis
IS - 8
ER -