基于深度强化学习与自学习的多无人机近距空战机动策略生成算法

Wei Ren Kong, De Yun Zhou, Yi Yang Zhao, Wan Sha Yang

科研成果: 期刊稿件文章同行评审

14 引用 (Scopus)

摘要

In order to solve the problem of multi-UAV close-range air combat maneuvering decision-making, a multi- UAV close-range air combat maneuvering strategy generation algorithm based on parameter sharing Q network and neural fictitious self-play is proposed. Firstly, a hybrid Markov game model suitable for different UAV formation sizes and a reinforcement learning framework for generating maneuvering decision strategies of multi-UAV are designed-parameter sharing Q network, and the state space is compressed through the autoencoder to improve the efficiency of strategy learning. Then, using the neural fictitious self-play makes the maneuver strategy converge to the Nash equilibrium strategy. Finally, simulation experiments are carried out on the parameter selection of the autoencoder, the training process of the strategy generation algorithm, and the rationality and portability of the maneuver strategy. The simulation results show that the autoencoder is introduced can effectively improve the efficiency of strategy learning, and the multi-UAV short-range air combat maneuver strategy generated by this algorithm is reasonable and good portability.

投稿的翻译标题Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
源语言繁体中文
页(从-至)352-362
页数11
期刊Kongzhi Lilun Yu Yingyong/Control Theory and Applications
39
2
DOI
出版状态已出版 - 2月 2022

关键词

  • Air combat decision-making
  • Fictitious self-play
  • Multi-UAV cooperation
  • Reinforcement learning

指纹

探究 '基于深度强化学习与自学习的多无人机近距空战机动策略生成算法' 的科研主题。它们共同构成独一无二的指纹。

引用此