TY - JOUR
T1 - Multi-UAV Interception Decision-Making via Meta Reinforcement Learning
AU - Qi, Chenyang
AU - Li, Huiping
AU - Zong, Guangdeng
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Multi-UAV collaborative interception decision-making is a critical task in uncrewed aerial vehicle (UAV) applications. The core challenge lies in achieving real-time decision-making and precise coordination among multiple agents in highly dynamic and uncertain complex environments to successfully intercept highly maneuverable targets. However, existing methods still fall short in terms of rapid adaptability for interception decision-making, and the generalization performance of traditional reinforcement learning (RL) on new tasks needs improvement. To address these limitations, this article presents a novel meta-RL-based approach for multi-UAV interception decision-making. First, a task allocation method based on greedy strategy is proposed to achieve the rapid assignment of interception targets. Then, we design a task inference model that employs a probabilistic encoder to extract the shared structure across tasks from contextual information, thereby generating a latent variable to represent the task and enhance the training of the algorithm's network. Meanwhile, the proposed method is integrated with RL to improve its generalization capability in new tasks, enabling fast decision-making for the interception task. Finally, the comparative results against baseline algorithms validated the superiority of our method.
AB - Multi-UAV collaborative interception decision-making is a critical task in uncrewed aerial vehicle (UAV) applications. The core challenge lies in achieving real-time decision-making and precise coordination among multiple agents in highly dynamic and uncertain complex environments to successfully intercept highly maneuverable targets. However, existing methods still fall short in terms of rapid adaptability for interception decision-making, and the generalization performance of traditional reinforcement learning (RL) on new tasks needs improvement. To address these limitations, this article presents a novel meta-RL-based approach for multi-UAV interception decision-making. First, a task allocation method based on greedy strategy is proposed to achieve the rapid assignment of interception targets. Then, we design a task inference model that employs a probabilistic encoder to extract the shared structure across tasks from contextual information, thereby generating a latent variable to represent the task and enhance the training of the algorithm's network. Meanwhile, the proposed method is integrated with RL to improve its generalization capability in new tasks, enabling fast decision-making for the interception task. Finally, the comparative results against baseline algorithms validated the superiority of our method.
KW - Interception decision-making
KW - meta reinforcement learning (meta-RL)
KW - multi-UAV
KW - task inference
UR - https://www.scopus.com/pages/publications/105025912490
U2 - 10.1109/TII.2025.3640714
DO - 10.1109/TII.2025.3640714
M3 - 文章
AN - SCOPUS:105025912490
SN - 1551-3203
JO - IEEE Transactions on Industrial Informatics
JF - IEEE Transactions on Industrial Informatics
ER -