基于知识辅助深度强化学习的巡飞弹组动态突防决策

Hao Sun, Haiqing Li, Yan Liang, Chaoxiong Ma, Han Wu

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

The loitering munition group penetration control decision (LMGPCD) is the key to improve the autonomy and intelligence of loitering munition group combat. A knowledge-assisted reinforcement learning-based LMGPCD algorithm is proposed to solve the issue due to the difficult online generation of penetration maneuver command for loitering munition group in the dynamic environment containing interceptors and air defenses. The state space and reward function are improved by domain knowledge and rule knowledge to enhance the generalization ability and training convergence speed of the algorithm. A LMGPCD decision framework based on the soft actor-critic (SAC) algorithm is constructed to increase the exploration efficiency of the algorithm. An expert experience applying and imitation learning method is utilized against the lacking of initial efficient training experience for the algorithm due to the narrow solution space caused by increasing number of missiles and threats. The experimental results show that the proposed algorithm can generate more effective penetration maneuver command in real time in a dynamic environment compared to other algorithm, which verifies the effectiveness of the proposed algorithm.

投稿的翻译标题Dynamic Penetration Decision of Loitering Munition Group Based on Knowledge-assisted Reinforcement Learning
源语言繁体中文
页(从-至)3161-3176
页数16
期刊Binggong Xuebao/Acta Armamentarii
45
9
DOI
出版状态已出版 - 30 9月 2024

关键词

  • control decision
  • dynamic environment penetration
  • knowledge-assisted deep reinforcement learning
  • loitering munition group
  • soft actor-critic algorithm

指纹

探究 '基于知识辅助深度强化学习的巡飞弹组动态突防决策' 的科研主题。它们共同构成独一无二的指纹。

引用此