基 于 自 博 弈 深 度 强 化 学 习 的 空 战 智 能 决 策 方 法

Shengzhe Shan, Weiwei Zhang

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Air combat is an important element in the three-dimensional nature of war,and intelligent air combat has become a hotspot and focus of research in the military field both domestically and internationally. Deep reinforcement learning is an important technological approach to achieving air combat intelligence. To address the challenge of constructing high-level opponents in single agent training method,a self-play based air combat agent training method is proposed,and a visualization research platform is built to develop a decision-making agent for close-range air combat. The field knowledge of pilots is embedded in the design process of the agent’s observation,action,and reward,training the agent to convergence. Simulation experiments show that the air combat tactics of agent gradually improves by self-play training,achieving a win rate of over 70% against the decision making by single agent training and the emerging of the strategies similar to human“single/double loop”tactics.

投稿的翻译标题Air combat intelligent decision-making method based on self-play and deep reinforcement learning
源语言繁体中文
文章编号328723
期刊Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica
45
4
DOI
出版状态已出版 - 25 2月 2024

关键词

  • agent
  • air combat
  • artificial intelligence
  • deep reinforcement learning
  • self-play

指纹

探究 '基 于 自 博 弈 深 度 强 化 学 习 的 空 战 智 能 决 策 方 法' 的科研主题。它们共同构成独一无二的指纹。

引用此