TY - GEN
T1 - Research on Maneuver Decision Algorithm of Multi-UAV Based on Action Intention Reinforcement Learning
AU - Huo, Weiyu
AU - Lian, Zhenjiang
AU - Liu, Yang
AU - Fang, Yeehom
AU - Zhou, Deyun
N1 - Publisher Copyright:
© 2025 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2025
Y1 - 2025
N2 - In multi-UAV air combat, traditional reinforcement learning (RL) struggles with high computational demands and policy learning inefficiencies arising from complex mission tasks and intensive inter-agent interactions. To address these limitations, this study proposes an intention-based RL framework that leverages prior knowledge to enhance learning efficiency and improve state-action mapping. Within this framework, a multi-UAV maneuver decision-making algorithm is developed, defining four action intentions - attack, surveillance, support, and evasion - to represent tactical objectives, each mapped to feasible maneuver actions based on situational data. The algorithm integrates a situational assessment model, cooperative target allocation, and a deep Q-network (DQN) to guide decision-making. Simulation results confirm the approach's effectiveness in enhancing multi-UAV maneuver coordination and decision performance.
AB - In multi-UAV air combat, traditional reinforcement learning (RL) struggles with high computational demands and policy learning inefficiencies arising from complex mission tasks and intensive inter-agent interactions. To address these limitations, this study proposes an intention-based RL framework that leverages prior knowledge to enhance learning efficiency and improve state-action mapping. Within this framework, a multi-UAV maneuver decision-making algorithm is developed, defining four action intentions - attack, surveillance, support, and evasion - to represent tactical objectives, each mapped to feasible maneuver actions based on situational data. The algorithm integrates a situational assessment model, cooperative target allocation, and a deep Q-network (DQN) to guide decision-making. Simulation results confirm the approach's effectiveness in enhancing multi-UAV maneuver coordination and decision performance.
KW - maneuvering decision
KW - multi-UAV
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/105020286231
U2 - 10.23919/CCC64809.2025.11178768
DO - 10.23919/CCC64809.2025.11178768
M3 - 会议稿件
AN - SCOPUS:105020286231
T3 - Chinese Control Conference, CCC
SP - 2784
EP - 2789
BT - Proceedings of the 44th Chinese Control Conference, CCC 2025
A2 - Sun, Jian
A2 - Yin, Hongpeng
PB - IEEE Computer Society
T2 - 44th Chinese Control Conference, CCC 2025
Y2 - 28 July 2025 through 30 July 2025
ER -