Global-and-Local Attention-Based Reinforcement Learning for Cooperative Behaviour Control of Multiple UAVs

Jinchao Chen, Tingyang Li, Ying Zhang, Tao You, Yantao Lu, Prayag Tiwari, Neeraj Kumar

科研成果: 期刊稿件文章同行评审

84 引用 (Scopus)

摘要

Due to the strong adaptability and high flexibility, unmanned aerial vehicles (UAVs) have been extensively studied and widely applied in both civil and military applications. Although UAVs can achieve significant cost reduction and performance enhancement in large-scale systems by taking full advantage of their cooperation and coordination, they result in a serious cooperative behaviour control problem. Especially in dynamic environments, the cooperative behaviour control problem which has to quickly produce a safe and effective behaviour decision for each UAV to achieve group missions, is NP-hard and difficult to settle. In this work, we design a global-and-local attention-based reinforcement learning algorithm for the cooperative behaviour control problem of UAVs. First, with the motion and coordination models, we analyze the collision avoidance, motion state update, and task execution constraints of multiple UAVs, and abstract the cooperative behaviour control problem as a multi-constraint decision-making one. Then, inspired from the human-learning process where more attention is devoted to the important parts of data, we design a multi-agent reinforcement learning algorithm with a global-and-local attention mechanism to cooperatively control the behaviours of UAVs and achieve the coordination. Simulation experiments in a multi-agent particle environment provided by OpenAI are conducted to verify the effectiveness and efficiency of the proposed approach. Compared with baselines, our approach shows significant advantages in mean reward, training time, and coordination effect.

源语言英语
页(从-至)4194-4206
页数13
期刊IEEE Transactions on Vehicular Technology
73
3
DOI
出版状态已出版 - 2024

指纹

探究 'Global-and-Local Attention-Based Reinforcement Learning for Cooperative Behaviour Control of Multiple UAVs' 的科研主题。它们共同构成独一无二的指纹。

引用此