深度确定性策略梯度和预测相结合的无人机空战决策研究

Yongfeng Li; Yongxi Lyu; Jingping Shi; Weihua Li

doi:10.1051/jnwpu/20234110056

深度确定性策略梯度和预测相结合的无人机空战决策研究

Translated title of the contribution: UAV′s air combat decision-making based on deep deterministic policy gradient and prediction

Yongfeng Li, Yongxi Lyu, Jingping Shi, Weihua Li

School of Automation

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

To solve the enemy uncertain manipulation problem during a UAV′s autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV′s six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV′s autonomous air combat decision-making on the premise of ensuring convergence.

Translated title of the contribution	UAV′s air combat decision-making based on deep deterministic policy gradient and prediction
Original language	Chinese (Traditional)
Pages (from-to)	56-64
Number of pages	9
Journal	Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
Volume	41
Issue number	1
DOIs	https://doi.org/10.1051/jnwpu/20234110056
State	Published - Feb 2023

Access to Document

10.1051/jnwpu/20234110056

Cite this

@article{dc150de9e5ea4fdbbf9d259c2a0e2295,

title = "深度确定性策略梯度和预测相结合的无人机空战决策研究",

abstract = "To solve the enemy uncertain manipulation problem during a UAV′s autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV′s six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV′s autonomous air combat decision-making on the premise of ensuring convergence.",

keywords = "air combat maneuver decision-making, deep deterministic policy gradient, prediction, UAV",

author = "Yongfeng Li and Yongxi Lyu and Jingping Shi and Weihua Li",

note = "Publisher Copyright: {\textcopyright}2023 Journal of Northwestern Polytechnical University.",

year = "2023",

month = feb,

doi = "10.1051/jnwpu/20234110056",

language = "繁体中文",

volume = "41",

pages = "56--64",

journal = "Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University",

issn = "1000-2758",

publisher = "Northwestern Polytechnical University",

number = "1",

}

TY - JOUR

T1 - 深度确定性策略梯度和预测相结合的无人机空战决策研究

AU - Li, Yongfeng

AU - Lyu, Yongxi

AU - Shi, Jingping

AU - Li, Weihua

PY - 2023/2

Y1 - 2023/2

N2 - To solve the enemy uncertain manipulation problem during a UAV′s autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV′s six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV′s autonomous air combat decision-making on the premise of ensuring convergence.

AB - To solve the enemy uncertain manipulation problem during a UAV′s autonomous air combat maneuver decision-making, this paper proposes an autonomous air combat maneuver decision-making method that combines target maneuver command prediction with the deep deterministic policy algorithm. The situation data of both sides of air combat are effectively fused and processed, the UAV′s six-degree-of-freedom model and maneuver library are built. In air combat, the target generates its corresponding maneuver library instructions through the deep Q network algorithm; at the same time, the UAV on our side gives the target maneuver prediction results through the probabilistic neural network. A deep deterministic policy gradient reinforcement learning method that considers both the situation information of two aircraft and the prediction results of enemy aircraft is proposed, so that the UAV can choose the appropriate maneuver decision according to the current air combat situation. The simulation results show that the method can effectively use the air combat situation information and target maneuver prediction information so that it can improve the effectiveness of the reinforcement learning method for UAV′s autonomous air combat decision-making on the premise of ensuring convergence.

KW - air combat maneuver decision-making

KW - deep deterministic policy gradient

KW - prediction

KW - UAV

UR - http://www.scopus.com/inward/record.url?scp=85162173813&partnerID=8YFLogxK

U2 - 10.1051/jnwpu/20234110056

DO - 10.1051/jnwpu/20234110056

M3 - 文章

AN - SCOPUS:85162173813

SN - 1000-2758

VL - 41

SP - 56

EP - 64

JO - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

JF - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

IS - 1

ER -

深度确定性策略梯度和预测相结合的无人机空战决策研究

Abstract

Access to Document

Other files and links

Fingerprint

Cite this