TY - GEN
T1 - Multi-UAV Cooperative Obstacle Avoidance Path Planning Based on PER-MASAC
AU - Bai, Yifan
AU - Zhang, Hongda
AU - Feng, Xiaoyi
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - To address the challenges of limited local perception and high decision-making coupling in multi-UAV cooperative obstacle avoidance under dynamic and complex battlefield environments, this paper proposes a Multi-Agent Soft Actor-Critic algorithm integrated with Prioritized Experience Replay (PER-MASAC). The method constructs a multidimensional state space incorporating target navigation, obstacle perception, and neighbor coordination. A hybrid reward function combining sparse and shaped rewards is designed to enhance learning efficiency. Furthermore, a SumTree-based priority sampling mechanism is introduced to improve the reuse of high-value experiences. Experimental results in a high-fidelity simulation environment demonstrate that compared to MASAC and other mainstream algorithms, PER-MASAC achieves significantly better performance in terms of convergence speed and task success rate. This provides a verifiable solution for intelligent decision-making of UAV swarms in unknown and complex environments.
AB - To address the challenges of limited local perception and high decision-making coupling in multi-UAV cooperative obstacle avoidance under dynamic and complex battlefield environments, this paper proposes a Multi-Agent Soft Actor-Critic algorithm integrated with Prioritized Experience Replay (PER-MASAC). The method constructs a multidimensional state space incorporating target navigation, obstacle perception, and neighbor coordination. A hybrid reward function combining sparse and shaped rewards is designed to enhance learning efficiency. Furthermore, a SumTree-based priority sampling mechanism is introduced to improve the reuse of high-value experiences. Experimental results in a high-fidelity simulation environment demonstrate that compared to MASAC and other mainstream algorithms, PER-MASAC achieves significantly better performance in terms of convergence speed and task success rate. This provides a verifiable solution for intelligent decision-making of UAV swarms in unknown and complex environments.
KW - Multi-UAV Cooperation
KW - Obstacle Avoidance
KW - Prioritized Experience Replay (PER)
KW - Soft Actor-Critic (SAC)
UR - https://www.scopus.com/pages/publications/105018452090
U2 - 10.1109/ICIPMC66319.2025.11170497
DO - 10.1109/ICIPMC66319.2025.11170497
M3 - 会议稿件
AN - SCOPUS:105018452090
T3 - 2025 4th International Conference on Image Processing and Media Computing, ICIPMC 2025
SP - 168
EP - 172
BT - 2025 4th International Conference on Image Processing and Media Computing, ICIPMC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Image Processing and Media Computing, ICIPMC 2025
Y2 - 27 June 2025 through 29 June 2025
ER -