深度强化学习的无人作战飞机空战机动决策

Yongfeng Li, Jingping Shi, Weiguo Zhang, Wei Jiang

科研成果: 期刊稿件文章同行评审

15 引用 (Scopus)

摘要

When an unmanned combat aerial vehicle (UCAV) is making the decision of autonomous maneuver in air combat, it faces large-scale calculation and is susceptible to the uncertain manipulation of the enemy. To tackle such problems, a decision-making model for autonomous maneuver of UCAV in air combat was proposed based on deep reinforcement learning algorithm in this study. With this algorithm, the UCAV can autonomously make maneuver decisions during air combat to achieve dominant position. First, based on the aircraft control system, a six-degree-of-freedom UCAV model was built using MATLAB/Simulink simulation platform, and the appropriate air combat action was selected as the maneuver output. On this basis, the decision-making model for the autonomous maneuver of UCAV in air combat was designed. Through the relative movement of both sides, the operational evaluation model was constructed. The range of the missile attack area was analyzed, and the corresponding advantage function was taken as the evaluation basis of the deep reinforcement learning. Then, the UCAV was trained by stages from the easy to the difficult, and the optimal maneuver control command was analyzed by investigating the deep Q network. Thereby, the UCAV could select corresponding maneuver actions in different situations and evaluate the battlefield situation independently, making tactical decisions and achieving the purpose of improving combat effectiveness. Simulation results suggest that the proposed method can make UCAV choose the tactical action independently in air combat and reach the dominant position quickly, which greatly improves the combat efficiency of the UCAV.

投稿的翻译标题Maneuver decision of UCAV in air combat based on deep reinforcement learning
源语言繁体中文
页(从-至)33-41
页数9
期刊Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology
53
12
DOI
出版状态已出版 - 30 12月 2021

关键词

  • Advantage function
  • Autonomous maneuver decision in air combat
  • Deep Q network
  • Deep reinforcement learning
  • Six-degree-of-freedom
  • Unmanned combat aerial vehicle (UCAV)

指纹

探究 '深度强化学习的无人作战飞机空战机动决策' 的科研主题。它们共同构成独一无二的指纹。

引用此