Multi-agent air combat with two-stage graph-attention communication

Zhixiao Sun; Huahua Wu; Yandong Shi; Xiangchao Yu; Yifan Gao; Wenbin Pei; Zhen Yang; Haiyin Piao; Yaqing Hou

doi:10.1007/s00521-023-08784-7

Multi-agent air combat with two-stage graph-attention communication

Zhixiao Sun, Huahua Wu, Yandong Shi, Xiangchao Yu, Yifan Gao, Wenbin Pei, Zhen Yang, Haiyin Piao, Yaqing Hou

电子信息学院

科研成果: 期刊稿件 › 文章 › 同行评审

12 引用（Scopus）

摘要

Air-to-air combat system is a complex multi-agent system (MAS) wherein a large number of unmanned combat aerial vehicles learn to combat with their opponents in a highly dynamic and uncertain environment. Because of the local observability of each individual, it is difficult for classical multi-agent learning methods to get effective cooperative strategies. Recently, a communication mechanism has been proposed to solve the local observability issue of MAS. However, existing methods with predefined rules easily cause an exponential increase in state–action pairs, leading to high communication costs. Taking this cue, this paper designs a graph neural network based on a two-stage graph-attention mechanism to capture the key interaction relationships and communication connections between agents in complex air-to-air combat scenarios. Based on an essential backbone multi-agent reinforcement learning method, known as Multi-Agent Proximal Policy Optimization, the proposed method with a hard- and soft-attention scheme can realize the dynamic adjustment of the communication relationship and ad hoc network of multiple agents, by cutting off the unrelated interaction connections while building the correlation importance between pair agents, concurrently. Last but not least, the experimental study in the simulation environment has validated the significance of our proposed method in solving the large-scale air-to-air combat problems.

源语言	英语
页（从-至）	19765-19781
页数	17
期刊	Neural Computing and Applications
卷	35
期	27
DOI	https://doi.org/10.1007/s00521-023-08784-7
出版状态	已出版 - 9月 2023

访问文件

10.1007/s00521-023-08784-7

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{143aabb8abf24d33963714675adeaa0c,

title = "Multi-agent air combat with two-stage graph-attention communication",

abstract = "Air-to-air combat system is a complex multi-agent system (MAS) wherein a large number of unmanned combat aerial vehicles learn to combat with their opponents in a highly dynamic and uncertain environment. Because of the local observability of each individual, it is difficult for classical multi-agent learning methods to get effective cooperative strategies. Recently, a communication mechanism has been proposed to solve the local observability issue of MAS. However, existing methods with predefined rules easily cause an exponential increase in state–action pairs, leading to high communication costs. Taking this cue, this paper designs a graph neural network based on a two-stage graph-attention mechanism to capture the key interaction relationships and communication connections between agents in complex air-to-air combat scenarios. Based on an essential backbone multi-agent reinforcement learning method, known as Multi-Agent Proximal Policy Optimization, the proposed method with a hard- and soft-attention scheme can realize the dynamic adjustment of the communication relationship and ad hoc network of multiple agents, by cutting off the unrelated interaction connections while building the correlation importance between pair agents, concurrently. Last but not least, the experimental study in the simulation environment has validated the significance of our proposed method in solving the large-scale air-to-air combat problems.",

keywords = "Air combat, Communication, Graph attention, Multi-agent system",

author = "Zhixiao Sun and Huahua Wu and Yandong Shi and Xiangchao Yu and Yifan Gao and Wenbin Pei and Zhen Yang and Haiyin Piao and Yaqing Hou",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2023",

month = sep,

doi = "10.1007/s00521-023-08784-7",

language = "英语",

volume = "35",

pages = "19765--19781",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "27",

}

TY - JOUR

T1 - Multi-agent air combat with two-stage graph-attention communication

AU - Sun, Zhixiao

AU - Wu, Huahua

AU - Shi, Yandong

AU - Yu, Xiangchao

AU - Gao, Yifan

AU - Pei, Wenbin

AU - Yang, Zhen

AU - Piao, Haiyin

AU - Hou, Yaqing

PY - 2023/9

Y1 - 2023/9

N2 - Air-to-air combat system is a complex multi-agent system (MAS) wherein a large number of unmanned combat aerial vehicles learn to combat with their opponents in a highly dynamic and uncertain environment. Because of the local observability of each individual, it is difficult for classical multi-agent learning methods to get effective cooperative strategies. Recently, a communication mechanism has been proposed to solve the local observability issue of MAS. However, existing methods with predefined rules easily cause an exponential increase in state–action pairs, leading to high communication costs. Taking this cue, this paper designs a graph neural network based on a two-stage graph-attention mechanism to capture the key interaction relationships and communication connections between agents in complex air-to-air combat scenarios. Based on an essential backbone multi-agent reinforcement learning method, known as Multi-Agent Proximal Policy Optimization, the proposed method with a hard- and soft-attention scheme can realize the dynamic adjustment of the communication relationship and ad hoc network of multiple agents, by cutting off the unrelated interaction connections while building the correlation importance between pair agents, concurrently. Last but not least, the experimental study in the simulation environment has validated the significance of our proposed method in solving the large-scale air-to-air combat problems.

AB - Air-to-air combat system is a complex multi-agent system (MAS) wherein a large number of unmanned combat aerial vehicles learn to combat with their opponents in a highly dynamic and uncertain environment. Because of the local observability of each individual, it is difficult for classical multi-agent learning methods to get effective cooperative strategies. Recently, a communication mechanism has been proposed to solve the local observability issue of MAS. However, existing methods with predefined rules easily cause an exponential increase in state–action pairs, leading to high communication costs. Taking this cue, this paper designs a graph neural network based on a two-stage graph-attention mechanism to capture the key interaction relationships and communication connections between agents in complex air-to-air combat scenarios. Based on an essential backbone multi-agent reinforcement learning method, known as Multi-Agent Proximal Policy Optimization, the proposed method with a hard- and soft-attention scheme can realize the dynamic adjustment of the communication relationship and ad hoc network of multiple agents, by cutting off the unrelated interaction connections while building the correlation importance between pair agents, concurrently. Last but not least, the experimental study in the simulation environment has validated the significance of our proposed method in solving the large-scale air-to-air combat problems.

KW - Air combat

KW - Communication

KW - Graph attention

KW - Multi-agent system

UR - http://www.scopus.com/inward/record.url?scp=85164207727&partnerID=8YFLogxK

U2 - 10.1007/s00521-023-08784-7

DO - 10.1007/s00521-023-08784-7

M3 - 文章

AN - SCOPUS:85164207727

SN - 0941-0643

VL - 35

SP - 19765

EP - 19781

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 27

ER -

Multi-agent air combat with two-stage graph-attention communication

摘要

访问文件

其它文件与链接

指纹

引用此