基 于 可 解 释 性 强 化 学 习 的 空 战 机 动 决 策 方 法

Translated title of the contribution: Decision-making method for air combat maneuver based on explainable reinforcement learning

Shuheng Yang, Dong Zhang, Wei Xiong, Zhi Ren, Shuo Tang

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Intelligent air combat is the trend of air combat in the future,and deep reinforcement learning is an impor- tant technical way to realize intelligent decision-making in air combat. However,due to the characteristic of“black box model”,deep reinforcement learning has the shortcomings such as difficulty in explaining strategies,understanding in- tentions,and trusting decisions,which brings challenges to the application of deep reinforcement learning in intelligent air combat. To solve these problems,an intelligent air combat maneuver decision-making method is proposed based on explainable reinforcement learning. Firstly,based on the strategy-level explanation method and dynamic Bayesian network,an interpretability model and the maneuvering intention recognition model are constructed. Secondly,through calculation of the importance of the decision and the probability of maneuvering intention,the intention-level of the Unmanned Aerial Vehicle(UAV)maneuver decision-making process can be explained. Finally,based on the in- tent interpretation results,the reward function and training strategy of the deep reinforcement learning algorithm are modified,and the effectiveness of the proposed method is verified by simulation and comparative analysis. The pro- posed method can obtain air combat maneuver strategies with excellent effectiveness,strong reliability,and high credibility.

Translated title of the contributionDecision-making method for air combat maneuver based on explainable reinforcement learning
Original languageChinese (Traditional)
Article number329922
JournalHangkong Xuebao/Acta Aeronautica et Astronautica Sinica
Volume45
Issue number18
DOIs
StatePublished - 25 Sep 2024

Fingerprint

Dive into the research topics of 'Decision-making method for air combat maneuver based on explainable reinforcement learning'. Together they form a unique fingerprint.

Cite this