深度强化学习的无人作战飞机空战机动决策

Translated title of the contribution: Maneuver decision of UCAV in air combat based on deep reinforcement learning

Yongfeng Li, Jingping Shi, Weiguo Zhang, Wei Jiang

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

When an unmanned combat aerial vehicle (UCAV) is making the decision of autonomous maneuver in air combat, it faces large-scale calculation and is susceptible to the uncertain manipulation of the enemy. To tackle such problems, a decision-making model for autonomous maneuver of UCAV in air combat was proposed based on deep reinforcement learning algorithm in this study. With this algorithm, the UCAV can autonomously make maneuver decisions during air combat to achieve dominant position. First, based on the aircraft control system, a six-degree-of-freedom UCAV model was built using MATLAB/Simulink simulation platform, and the appropriate air combat action was selected as the maneuver output. On this basis, the decision-making model for the autonomous maneuver of UCAV in air combat was designed. Through the relative movement of both sides, the operational evaluation model was constructed. The range of the missile attack area was analyzed, and the corresponding advantage function was taken as the evaluation basis of the deep reinforcement learning. Then, the UCAV was trained by stages from the easy to the difficult, and the optimal maneuver control command was analyzed by investigating the deep Q network. Thereby, the UCAV could select corresponding maneuver actions in different situations and evaluate the battlefield situation independently, making tactical decisions and achieving the purpose of improving combat effectiveness. Simulation results suggest that the proposed method can make UCAV choose the tactical action independently in air combat and reach the dominant position quickly, which greatly improves the combat efficiency of the UCAV.

Translated title of the contributionManeuver decision of UCAV in air combat based on deep reinforcement learning
Original languageChinese (Traditional)
Pages (from-to)33-41
Number of pages9
JournalHarbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology
Volume53
Issue number12
DOIs
StatePublished - 30 Dec 2021

Fingerprint

Dive into the research topics of 'Maneuver decision of UCAV in air combat based on deep reinforcement learning'. Together they form a unique fingerprint.

Cite this