Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning

Weiren Kong; Deyun Zhou; Kai Zhang; Zhen Yang

doi:10.1109/ICCA51439.2020.9264567

Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning

Weiren Kong, Deyun Zhou, Kai Zhang, Zhen Yang

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

28 Scopus citations

Abstract

Based on a robust multi-agent reinforcement learning (MARL) algorithm framework, an autonomous maneuver decision-making algorithm for UCAV air combat in one-on-one combat in the visible range is designed and implemented. This algorithm can solve the problem that the single agent reinforcement learning algorithm cannot converge during the training process due to the unstable environment. At the same time, considering the shortcomings of the MADDPG algorithm in a strong competitive environment, it is easy to obtain a very fragile strategy, which is only targeted at a specific equilibrium strategy. In this paper, a minimax module is introduced to obtain the expected perturbation, which can locally approach the worst-case perturbation through the gradient. Through simulation tests of algorithm convergence and policy quality, the algorithm is found to be effective.

Original language	English
Title of host publication	2020 IEEE 16th International Conference on Control and Automation, ICCA 2020
Publisher	IEEE Computer Society
Pages	506-512
Number of pages	7
ISBN (Electronic)	9781728190938
DOIs	https://doi.org/10.1109/ICCA51439.2020.9264567
State	Published - 9 Oct 2020
Event	16th IEEE International Conference on Control and Automation, ICCA 2020 - Virtual, Sapporo, Hokkaido, Japan Duration: 9 Oct 2020 → 11 Oct 2020

Publication series

Name	IEEE International Conference on Control and Automation, ICCA
Volume	2020-October
ISSN (Print)	1948-3449
ISSN (Electronic)	1948-3457

Conference

Conference	16th IEEE International Conference on Control and Automation, ICCA 2020
Country/Territory	Japan
City	Virtual, Sapporo, Hokkaido
Period	9/10/20 → 11/10/20

Keywords

Air combat
Maneuver strategy
Reinforcement learning
Robust MADDPG

Access to Document

10.1109/ICCA51439.2020.9264567

Cite this

Kong, W., Zhou, D., Zhang, K., & Yang, Z. (2020). Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning. In 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020 (pp. 506-512). Article 9264567 (IEEE International Conference on Control and Automation, ICCA; Vol. 2020-October). IEEE Computer Society. https://doi.org/10.1109/ICCA51439.2020.9264567

Kong, Weiren ; Zhou, Deyun ; Zhang, Kai et al. / Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning. 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020. IEEE Computer Society, 2020. pp. 506-512 (IEEE International Conference on Control and Automation, ICCA).

@inproceedings{587c12218a064a3ea8d9c71c734dd1d6,

title = "Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning",

abstract = "Based on a robust multi-agent reinforcement learning (MARL) algorithm framework, an autonomous maneuver decision-making algorithm for UCAV air combat in one-on-one combat in the visible range is designed and implemented. This algorithm can solve the problem that the single agent reinforcement learning algorithm cannot converge during the training process due to the unstable environment. At the same time, considering the shortcomings of the MADDPG algorithm in a strong competitive environment, it is easy to obtain a very fragile strategy, which is only targeted at a specific equilibrium strategy. In this paper, a minimax module is introduced to obtain the expected perturbation, which can locally approach the worst-case perturbation through the gradient. Through simulation tests of algorithm convergence and policy quality, the algorithm is found to be effective.",

keywords = "Air combat, Maneuver strategy, Reinforcement learning, Robust MADDPG",

author = "Weiren Kong and Deyun Zhou and Kai Zhang and Zhen Yang",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 16th IEEE International Conference on Control and Automation, ICCA 2020 ; Conference date: 09-10-2020 Through 11-10-2020",

year = "2020",

month = oct,

day = "9",

doi = "10.1109/ICCA51439.2020.9264567",

language = "英语",

series = "IEEE International Conference on Control and Automation, ICCA",

publisher = "IEEE Computer Society",

pages = "506--512",

booktitle = "2020 IEEE 16th International Conference on Control and Automation, ICCA 2020",

}

Kong, W, Zhou, D , Zhang, K & Yang, Z 2020, Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning. in 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020., 9264567, IEEE International Conference on Control and Automation, ICCA, vol. 2020-October, IEEE Computer Society, pp. 506-512, 16th IEEE International Conference on Control and Automation, ICCA 2020, Virtual, Sapporo, Hokkaido, Japan, 9/10/20. https://doi.org/10.1109/ICCA51439.2020.9264567

Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning. / Kong, Weiren; Zhou, Deyun ; Zhang, Kai et al.
2020 IEEE 16th International Conference on Control and Automation, ICCA 2020. IEEE Computer Society, 2020. p. 506-512 9264567 (IEEE International Conference on Control and Automation, ICCA; Vol. 2020-October).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning

AU - Kong, Weiren

AU - Zhou, Deyun

AU - Zhang, Kai

AU - Yang, Zhen

PY - 2020/10/9

Y1 - 2020/10/9

N2 - Based on a robust multi-agent reinforcement learning (MARL) algorithm framework, an autonomous maneuver decision-making algorithm for UCAV air combat in one-on-one combat in the visible range is designed and implemented. This algorithm can solve the problem that the single agent reinforcement learning algorithm cannot converge during the training process due to the unstable environment. At the same time, considering the shortcomings of the MADDPG algorithm in a strong competitive environment, it is easy to obtain a very fragile strategy, which is only targeted at a specific equilibrium strategy. In this paper, a minimax module is introduced to obtain the expected perturbation, which can locally approach the worst-case perturbation through the gradient. Through simulation tests of algorithm convergence and policy quality, the algorithm is found to be effective.

AB - Based on a robust multi-agent reinforcement learning (MARL) algorithm framework, an autonomous maneuver decision-making algorithm for UCAV air combat in one-on-one combat in the visible range is designed and implemented. This algorithm can solve the problem that the single agent reinforcement learning algorithm cannot converge during the training process due to the unstable environment. At the same time, considering the shortcomings of the MADDPG algorithm in a strong competitive environment, it is easy to obtain a very fragile strategy, which is only targeted at a specific equilibrium strategy. In this paper, a minimax module is introduced to obtain the expected perturbation, which can locally approach the worst-case perturbation through the gradient. Through simulation tests of algorithm convergence and policy quality, the algorithm is found to be effective.

KW - Air combat

KW - Maneuver strategy

KW - Reinforcement learning

KW - Robust MADDPG

UR - http://www.scopus.com/inward/record.url?scp=85098057782&partnerID=8YFLogxK

U2 - 10.1109/ICCA51439.2020.9264567

DO - 10.1109/ICCA51439.2020.9264567

M3 - 会议稿件

AN - SCOPUS:85098057782

T3 - IEEE International Conference on Control and Automation, ICCA

SP - 506

EP - 512

BT - 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020

PB - IEEE Computer Society

T2 - 16th IEEE International Conference on Control and Automation, ICCA 2020

Y2 - 9 October 2020 through 11 October 2020

ER -

Kong W, Zhou D , Zhang K , Yang Z. Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning. In 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020. IEEE Computer Society. 2020. p. 506-512. 9264567. (IEEE International Conference on Control and Automation, ICCA). doi: 10.1109/ICCA51439.2020.9264567

Air combat autonomous maneuver decision for one-on-one within visual range engagement base on robust multi-agent reinforcement learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this