TY - JOUR
T1 - Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform
AU - Kong, Weiren
AU - Zhou, Deyun
AU - Du, Yongjie
AU - Zhou, Ying
AU - Zhao, Yiyang
N1 - Publisher Copyright:
© 2001-2012 IEEE.
PY - 2023/9/15
Y1 - 2023/9/15
N2 - Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.
AB - Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.
KW - Artificial intelligence (AI)
KW - autonomous air combat
KW - competitive self-play (SP)
KW - maneuver decision-making
KW - multiagent reinforcement learning (MARL)
UR - http://www.scopus.com/inward/record.url?scp=85141625656&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2022.3220324
DO - 10.1109/JSEN.2022.3220324
M3 - 文章
AN - SCOPUS:85141625656
SN - 1530-437X
VL - 23
SP - 20596
EP - 20606
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 18
ER -