TY - JOUR
T1 - Hierarchical decision-making for UAVs’ game via LLM enhanced multi-agent reinforcement learning
AU - Dong, Xinyu
AU - Li, Bo
AU - Zhang, Guangyu
AU - Xiao, Bing
AU - Wang, Yuanshun
N1 - Publisher Copyright:
© 2026 Elsevier Masson SAS.
PY - 2026/10
Y1 - 2026/10
N2 - For the multi-UAV game missions, intelligent agents face challenges in high-dimensional state processing, dynamic situational awareness, and cooperative decision-making that conventional methods cannot address effectively. This paper proposes a novel Large Language Model-enhanced Multi-agent Deep Reinforcement Learning scheme (LLM-MATD3) with a hierarchical decision-making architecture comprising an environment modeling layer, LLM task planner, and task-enhanced MATD3 network. The scheme first employs a hierarchical task decomposition mechanism that transforms high-level game directives from LLMs into continuous embedding vectors through task embedding techniques. Then, the LLM task planner analyzes global game situations and assigns differentiated tasks to UAVs, overcoming the limitation of traditional MATD3 frameworks that rely solely on local observations. Simulation results show significant improvements in win rate, remaining rate, and collaborative efficiency for UAVs, providing a innovative reference approach for multi-agent systems in complex game environments.
AB - For the multi-UAV game missions, intelligent agents face challenges in high-dimensional state processing, dynamic situational awareness, and cooperative decision-making that conventional methods cannot address effectively. This paper proposes a novel Large Language Model-enhanced Multi-agent Deep Reinforcement Learning scheme (LLM-MATD3) with a hierarchical decision-making architecture comprising an environment modeling layer, LLM task planner, and task-enhanced MATD3 network. The scheme first employs a hierarchical task decomposition mechanism that transforms high-level game directives from LLMs into continuous embedding vectors through task embedding techniques. Then, the LLM task planner analyzes global game situations and assigns differentiated tasks to UAVs, overcoming the limitation of traditional MATD3 frameworks that rely solely on local observations. Simulation results show significant improvements in win rate, remaining rate, and collaborative efficiency for UAVs, providing a innovative reference approach for multi-agent systems in complex game environments.
KW - Hierarchical decision making
KW - Large language model (LLM)
KW - Multi-agent reinforcement learning
KW - UAV game
UR - https://www.scopus.com/pages/publications/105034393080
U2 - 10.1016/j.ast.2026.112218
DO - 10.1016/j.ast.2026.112218
M3 - 文章
AN - SCOPUS:105034393080
SN - 1270-9638
VL - 177
JO - Aerospace Science and Technology
JF - Aerospace Science and Technology
M1 - 112218
ER -