TY - JOUR
T1 - 基于 AM-SAC 的无人机自主空战决策
AU - Li, Zenglin
AU - Li, Bo
AU - Bai, Shuangxia
AU - Meng, Bobo
N1 - Publisher Copyright:
© 2023 China Ordnance Society. All rights reserved.
PY - 2023/9/20
Y1 - 2023/9/20
N2 - To address the autonomous decision-making problem of unmanned aerial vehicles (UAV) in modern air combats, a maneuvering decision algorithm based on AM-SAC algorithm is proposed by combining the Attention Mechanism (AM) with Soft Actor Critic (SAC) in deep reinforcement learning. Focusing on 1V1 combat scenarios, the UAV three degree of freedom maneuvering model and the UAV close-range air combat model are established, and the missile attack zone model is built based on the relative distance and relative azimuth angle between both sides in a combat. The attention mechanism is introduced into SAC algorithm to construct the weight network, so as to realize the dynamic adjustment of the weight distribution of reward function during the training process. The simulation experiments are also designed. By comparing with SAC algorithm and testing in multiple environments with different initial situations, it is verified that the UAV air combat decision algorithm based on the AM-SAC algorithm has higher convergence speed and maneuvering stability, as well as better performance in air combat across various initial environments.
AB - To address the autonomous decision-making problem of unmanned aerial vehicles (UAV) in modern air combats, a maneuvering decision algorithm based on AM-SAC algorithm is proposed by combining the Attention Mechanism (AM) with Soft Actor Critic (SAC) in deep reinforcement learning. Focusing on 1V1 combat scenarios, the UAV three degree of freedom maneuvering model and the UAV close-range air combat model are established, and the missile attack zone model is built based on the relative distance and relative azimuth angle between both sides in a combat. The attention mechanism is introduced into SAC algorithm to construct the weight network, so as to realize the dynamic adjustment of the weight distribution of reward function during the training process. The simulation experiments are also designed. By comparing with SAC algorithm and testing in multiple environments with different initial situations, it is verified that the UAV air combat decision algorithm based on the AM-SAC algorithm has higher convergence speed and maneuvering stability, as well as better performance in air combat across various initial environments.
KW - air combat decision-making algorithm
KW - attention mechanism
KW - soft actor critic
KW - unmanned aerial vehicles
UR - http://www.scopus.com/inward/record.url?scp=85172725011&partnerID=8YFLogxK
U2 - 10.12382/bgxb.2022.0669
DO - 10.12382/bgxb.2022.0669
M3 - 文章
AN - SCOPUS:85172725011
SN - 1000-1093
VL - 44
SP - 2849
EP - 2858
JO - Binggong Xuebao/Acta Armamentarii
JF - Binggong Xuebao/Acta Armamentarii
IS - 9
ER -