Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning

Weiren Kong, Deyun Zhou, Kai Zhang, Zhen Yang, Wansha Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Scopus citations

Abstract

We present an approach for learning a reactive maneuver strategy for a UCAV formation involved in a short-range multi-UCAV air combat engagement. Specifically, we define an efficient state representation, which breaks down the complexity caused by the large state space in a multi-UCAV air combat engagement. Then a parameter sharing dueling deep Q-network (PS-DDQN) algorithm is proposed to train the UCAV formation. The learning reactive maneuver strategy is shared among our UCAVs to encourage cooperative behaviors. In addition, curriculum learning and self-play extend the maneuver strategy to more difficult scenarios. Thus, speeding up the training process and improving the learning effect. Finally, the effectiveness of the algorithm and the intelligence degree of maneuver strategy is verified by the simulation test of convergence and maneuver strategy quality.

Original languageEnglish
Title of host publicationProceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020
EditorsM. Arif Wani, Feng Luo, Xiaolin Li, Dejing Dou, Francesco Bonchi
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1174-1181
Number of pages8
ISBN (Electronic)9781728184708
DOIs
StatePublished - Dec 2020
Event19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 - Virtual, Miami, United States
Duration: 14 Dec 202017 Dec 2020

Publication series

NameProceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020

Conference

Conference19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020
Country/TerritoryUnited States
CityVirtual, Miami
Period14/12/2017/12/20

Keywords

  • air combat
  • curriculum learning
  • Multi-UCAV
  • reinforcement learning
  • training simulations

Fingerprint

Dive into the research topics of 'Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning'. Together they form a unique fingerprint.

Cite this