Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention

Xingyu Wang; Zhen Yang; Xiaoyang Li; Shiyuan Chai; Yupeng He; Deyun Zhou

doi:10.1109/ICCA62789.2024.10591840

Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention

Xingyu Wang, Zhen Yang, Xiaoyang Li, Shiyuan Chai, Yupeng He, Deyun Zhou

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Although deep reinforcement learning applied to air combat has achieved good results, it still faces a series of challenges such as reward design, convergence of suboptimal solutions, and poor stability. In this regard, this paper proposes a behaviour generation algorithm based on Dueling-Noisy-Multi-step DQN for air combat under escape intent. By analysing the air combat confrontation process, we extract the escape intention features and establish the corresponding reward model; for the problem of poor stability and slow convergence of deep reinforcement learning algorithms in large-scale state-action space, we propose the Dueling-Noisy-Multi-step DQN algorithm, which improves the accuracy of the value function fitting and at the same time increases the efficiency of spatial exploration and network generalization. Comparison with other algorithms through simulation experiments, the results reflect the excellent performance of this paper's algorithm.

Original language	English
Title of host publication	2024 IEEE 18th International Conference on Control and Automation, ICCA 2024
Publisher	IEEE Computer Society
Pages	228-233
Number of pages	6
ISBN (Electronic)	9798350354409
DOIs	https://doi.org/10.1109/ICCA62789.2024.10591840
State	Published - 2024
Event	18th IEEE International Conference on Control and Automation, ICCA 2024 - Reykjavik, Iceland Duration: 18 Jun 2024 → 21 Jun 2024

Publication series

Name	IEEE International Conference on Control and Automation, ICCA
ISSN (Print)	1948-3449
ISSN (Electronic)	1948-3457

Conference

Conference	18th IEEE International Conference on Control and Automation, ICCA 2024
Country/Territory	Iceland
City	Reykjavik
Period	18/06/24 → 21/06/24

Access to Document

10.1109/ICCA62789.2024.10591840

Cite this

Wang, X., Yang, Z., Li, X., Chai, S., He, Y., & Zhou, D. (2024). Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention. In 2024 IEEE 18th International Conference on Control and Automation, ICCA 2024 (pp. 228-233). (IEEE International Conference on Control and Automation, ICCA). IEEE Computer Society. https://doi.org/10.1109/ICCA62789.2024.10591840

@inproceedings{ec573bd0cec9487e831b03d4f93fb79e,

title = "Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention",

abstract = "Although deep reinforcement learning applied to air combat has achieved good results, it still faces a series of challenges such as reward design, convergence of suboptimal solutions, and poor stability. In this regard, this paper proposes a behaviour generation algorithm based on Dueling-Noisy-Multi-step DQN for air combat under escape intent. By analysing the air combat confrontation process, we extract the escape intention features and establish the corresponding reward model; for the problem of poor stability and slow convergence of deep reinforcement learning algorithms in large-scale state-action space, we propose the Dueling-Noisy-Multi-step DQN algorithm, which improves the accuracy of the value function fitting and at the same time increases the efficiency of spatial exploration and network generalization. Comparison with other algorithms through simulation experiments, the results reflect the excellent performance of this paper's algorithm.",

author = "Xingyu Wang and Zhen Yang and Xiaoyang Li and Shiyuan Chai and Yupeng He and Deyun Zhou",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 18th IEEE International Conference on Control and Automation, ICCA 2024 ; Conference date: 18-06-2024 Through 21-06-2024",

year = "2024",

doi = "10.1109/ICCA62789.2024.10591840",

language = "英语",

series = "IEEE International Conference on Control and Automation, ICCA",

publisher = "IEEE Computer Society",

pages = "228--233",

booktitle = "2024 IEEE 18th International Conference on Control and Automation, ICCA 2024",

}

Wang, X, Yang, Z, Li, X, Chai, S, He, Y & Zhou, D 2024, Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention. in 2024 IEEE 18th International Conference on Control and Automation, ICCA 2024. IEEE International Conference on Control and Automation, ICCA, IEEE Computer Society, pp. 228-233, 18th IEEE International Conference on Control and Automation, ICCA 2024, Reykjavik, Iceland, 18/06/24. https://doi.org/10.1109/ICCA62789.2024.10591840

Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention. / Wang, Xingyu; Yang, Zhen; Li, Xiaoyang et al.
2024 IEEE 18th International Conference on Control and Automation, ICCA 2024. IEEE Computer Society, 2024. p. 228-233 (IEEE International Conference on Control and Automation, ICCA).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention

AU - Wang, Xingyu

AU - Yang, Zhen

AU - Li, Xiaoyang

AU - Chai, Shiyuan

AU - He, Yupeng

AU - Zhou, Deyun

PY - 2024

Y1 - 2024

N2 - Although deep reinforcement learning applied to air combat has achieved good results, it still faces a series of challenges such as reward design, convergence of suboptimal solutions, and poor stability. In this regard, this paper proposes a behaviour generation algorithm based on Dueling-Noisy-Multi-step DQN for air combat under escape intent. By analysing the air combat confrontation process, we extract the escape intention features and establish the corresponding reward model; for the problem of poor stability and slow convergence of deep reinforcement learning algorithms in large-scale state-action space, we propose the Dueling-Noisy-Multi-step DQN algorithm, which improves the accuracy of the value function fitting and at the same time increases the efficiency of spatial exploration and network generalization. Comparison with other algorithms through simulation experiments, the results reflect the excellent performance of this paper's algorithm.

AB - Although deep reinforcement learning applied to air combat has achieved good results, it still faces a series of challenges such as reward design, convergence of suboptimal solutions, and poor stability. In this regard, this paper proposes a behaviour generation algorithm based on Dueling-Noisy-Multi-step DQN for air combat under escape intent. By analysing the air combat confrontation process, we extract the escape intention features and establish the corresponding reward model; for the problem of poor stability and slow convergence of deep reinforcement learning algorithms in large-scale state-action space, we propose the Dueling-Noisy-Multi-step DQN algorithm, which improves the accuracy of the value function fitting and at the same time increases the efficiency of spatial exploration and network generalization. Comparison with other algorithms through simulation experiments, the results reflect the excellent performance of this paper's algorithm.

UR - http://www.scopus.com/inward/record.url?scp=85200390545&partnerID=8YFLogxK

U2 - 10.1109/ICCA62789.2024.10591840

DO - 10.1109/ICCA62789.2024.10591840

M3 - 会议稿件

AN - SCOPUS:85200390545

T3 - IEEE International Conference on Control and Automation, ICCA

SP - 228

EP - 233

BT - 2024 IEEE 18th International Conference on Control and Automation, ICCA 2024

PB - IEEE Computer Society

T2 - 18th IEEE International Conference on Control and Automation, ICCA 2024

Y2 - 18 June 2024 through 21 June 2024

ER -

Deep Reinforcement Learning-based Behaviour Generation Algorithm for Air Combat Escape Intention

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this