Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention

Xingyu Wang, Zhen Yang, Xiaoyang Li, Shiyuan Chai, Yupeng He, Deyun Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Although deep reinforcement learning applied to air combat has achieved good results, it still faces a series of challenges such as reward design, convergence of suboptimal solutions, and poor stability. In this regard, this paper proposes a behaviour generation algorithm based on Dueling-Noisy-Multi-step DQN for air combat under escape intent. By analysing the air combat confrontation process, we extract the escape intention features and establish the corresponding reward model; for the problem of poor stability and slow convergence of deep reinforcement learning algorithms in large-scale state-action space, we propose the Dueling-Noisy-Multi-step DQN algorithm, which improves the accuracy of the value function fitting and at the same time increases the efficiency of spatial exploration and network generalization. Comparison with other algorithms through simulation experiments, the results reflect the excellent performance of this paper's algorithm.

Original languageEnglish
Title of host publication2024 IEEE 18th International Conference on Control and Automation, ICCA 2024
PublisherIEEE Computer Society
Pages228-233
Number of pages6
ISBN (Electronic)9798350354409
DOIs
StatePublished - 2024
Event18th IEEE International Conference on Control and Automation, ICCA 2024 - Reykjavik, Iceland
Duration: 18 Jun 202421 Jun 2024

Publication series

NameIEEE International Conference on Control and Automation, ICCA
ISSN (Print)1948-3449
ISSN (Electronic)1948-3457

Conference

Conference18th IEEE International Conference on Control and Automation, ICCA 2024
Country/TerritoryIceland
CityReykjavik
Period18/06/2421/06/24

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention'. Together they form a unique fingerprint.

Cite this