Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

Weiren Kong; Deyun Zhou; Yongjie Du; Ying Zhou; Yiyang Zhao

doi:10.1109/JSEN.2022.3220324

Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

Weiren Kong, Deyun Zhou, Yongjie Du, Ying Zhou, Yiyang Zhao

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.

Original language	English
Pages (from-to)	20596-20606
Number of pages	11
Journal	IEEE Sensors Journal
Volume	23
Issue number	18
DOIs	https://doi.org/10.1109/JSEN.2022.3220324
State	Published - 15 Sep 2023

Keywords

Artificial intelligence (AI)
autonomous air combat
competitive self-play (SP)
maneuver decision-making
multiagent reinforcement learning (MARL)

Access to Document

10.1109/JSEN.2022.3220324

Cite this

@article{2343bbdfcaf743be9cb44ea05dc6fb11,

title = "Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform",

abstract = "Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.",

keywords = "Artificial intelligence (AI), autonomous air combat, competitive self-play (SP), maneuver decision-making, multiagent reinforcement learning (MARL)",

author = "Weiren Kong and Deyun Zhou and Yongjie Du and Ying Zhou and Yiyang Zhao",

note = "Publisher Copyright: {\textcopyright} 2001-2012 IEEE.",

year = "2023",

month = sep,

day = "15",

doi = "10.1109/JSEN.2022.3220324",

language = "英语",

volume = "23",

pages = "20596--20606",

journal = "IEEE Sensors Journal",

issn = "1530-437X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "18",

}

TY - JOUR

T1 - Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

AU - Kong, Weiren

AU - Zhou, Deyun

AU - Du, Yongjie

AU - Zhou, Ying

AU - Zhao, Yiyang

PY - 2023/9/15

Y1 - 2023/9/15

N2 - Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.

AB - Autonomous air combat has received significant attention from researchers working on artificial intelligence (AI) applications. Most previous research on autonomous air combat has focused on one-on-one air combat scenarios in which air combat situational information is considered to be precisely observable. However, most modern air combats are conducted in formations, where air combat situational information is obtained from multiple sensors. Therefore, we introduce a novel automated maneuver decision architecture for close-range multiaircraft air combat scenarios under the multisensor unmanned combat aerial vehicle (UCAV) platform that can handle air combat scenarios with variable-sized formations. Then, a multiagent reinforcement learning (MARL) algorithm is proposed to obtain the strategy. The training performance of the training algorithm is evaluated, the obtained strategy is analyzed in different air combat scenarios, and it is found that these formations exhibit effective cooperative behavior in symmetric and asymmetric situations. Finally, we give ideas for the engineering implementation of a maneuver control architecture. This study provides a solution for future multiaircraft autonomous air combat.

KW - Artificial intelligence (AI)

KW - autonomous air combat

KW - competitive self-play (SP)

KW - maneuver decision-making

KW - multiagent reinforcement learning (MARL)

UR - http://www.scopus.com/inward/record.url?scp=85141625656&partnerID=8YFLogxK

U2 - 10.1109/JSEN.2022.3220324

DO - 10.1109/JSEN.2022.3220324

M3 - 文章

AN - SCOPUS:85141625656

SN - 1530-437X

VL - 23

SP - 20596

EP - 20606

JO - IEEE Sensors Journal

JF - IEEE Sensors Journal

IS - 18

ER -

Reinforcement Learning for Multiaircraft Autonomous Air Combat in Multisensor UCAV Platform

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this