Autonomous maneuver strategy of swarm air combat based on DDPG

Luhe Wang; Jinwen Hu; Zhao Xu; Chunhui Zhao

doi:10.1007/s43684-021-00013-z

Autonomous maneuver strategy of swarm air combat based on DDPG

Luhe Wang, Jinwen Hu, Zhao Xu, Chunhui Zhao

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

24 引用（Scopus）

摘要

Unmanned aerial vehicles (UAVs) have been found significantly important in the air combats, where intelligent and swarms of UAVs will be able to tackle with the tasks of high complexity and dynamics. The key to empower the UAVs with such capability is the autonomous maneuver decision making. In this paper, an autonomous maneuver strategy of UAV swarms in beyond visual range air combat based on reinforcement learning is proposed. First, based on the process of air combat and the constraints of the swarm, the motion model of UAV and the multi-to-one air combat model are established. Second, a two-stage maneuver strategy based on air combat principles is designed which include inter-vehicle collaboration and target-vehicle confrontation. Then, a swarm air combat algorithm based on deep deterministic policy gradient strategy (DDPG) is proposed for online strategy training. Finally, the effectiveness of the proposed algorithm is validated by multi-scene simulations. The results show that the algorithm is suitable for UAV swarms of different scales.

源语言	英语
文章编号	15
期刊	Autonomous Intelligent Systems
卷	1
期	1
DOI	https://doi.org/10.1007/s43684-021-00013-z
出版状态	已出版 - 12月 2021

访问文件

10.1007/s43684-021-00013-z

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3dfe266e7eb34070a4ef3d51c7b1b17e,

title = "Autonomous maneuver strategy of swarm air combat based on DDPG",

abstract = "Unmanned aerial vehicles (UAVs) have been found significantly important in the air combats, where intelligent and swarms of UAVs will be able to tackle with the tasks of high complexity and dynamics. The key to empower the UAVs with such capability is the autonomous maneuver decision making. In this paper, an autonomous maneuver strategy of UAV swarms in beyond visual range air combat based on reinforcement learning is proposed. First, based on the process of air combat and the constraints of the swarm, the motion model of UAV and the multi-to-one air combat model are established. Second, a two-stage maneuver strategy based on air combat principles is designed which include inter-vehicle collaboration and target-vehicle confrontation. Then, a swarm air combat algorithm based on deep deterministic policy gradient strategy (DDPG) is proposed for online strategy training. Finally, the effectiveness of the proposed algorithm is validated by multi-scene simulations. The results show that the algorithm is suitable for UAV swarms of different scales.",

keywords = "Cooperative air combat, Deep reinforcement learning, Maneuver strategy, Swarm",

author = "Luhe Wang and Jinwen Hu and Zhao Xu and Chunhui Zhao",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = dec,

doi = "10.1007/s43684-021-00013-z",

language = "英语",

volume = "1",

journal = "Autonomous Intelligent Systems",

issn = "2730-616X",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - Autonomous maneuver strategy of swarm air combat based on DDPG

AU - Wang, Luhe

AU - Hu, Jinwen

AU - Xu, Zhao

AU - Zhao, Chunhui

PY - 2021/12

Y1 - 2021/12

N2 - Unmanned aerial vehicles (UAVs) have been found significantly important in the air combats, where intelligent and swarms of UAVs will be able to tackle with the tasks of high complexity and dynamics. The key to empower the UAVs with such capability is the autonomous maneuver decision making. In this paper, an autonomous maneuver strategy of UAV swarms in beyond visual range air combat based on reinforcement learning is proposed. First, based on the process of air combat and the constraints of the swarm, the motion model of UAV and the multi-to-one air combat model are established. Second, a two-stage maneuver strategy based on air combat principles is designed which include inter-vehicle collaboration and target-vehicle confrontation. Then, a swarm air combat algorithm based on deep deterministic policy gradient strategy (DDPG) is proposed for online strategy training. Finally, the effectiveness of the proposed algorithm is validated by multi-scene simulations. The results show that the algorithm is suitable for UAV swarms of different scales.

AB - Unmanned aerial vehicles (UAVs) have been found significantly important in the air combats, where intelligent and swarms of UAVs will be able to tackle with the tasks of high complexity and dynamics. The key to empower the UAVs with such capability is the autonomous maneuver decision making. In this paper, an autonomous maneuver strategy of UAV swarms in beyond visual range air combat based on reinforcement learning is proposed. First, based on the process of air combat and the constraints of the swarm, the motion model of UAV and the multi-to-one air combat model are established. Second, a two-stage maneuver strategy based on air combat principles is designed which include inter-vehicle collaboration and target-vehicle confrontation. Then, a swarm air combat algorithm based on deep deterministic policy gradient strategy (DDPG) is proposed for online strategy training. Finally, the effectiveness of the proposed algorithm is validated by multi-scene simulations. The results show that the algorithm is suitable for UAV swarms of different scales.

KW - Cooperative air combat

KW - Deep reinforcement learning

KW - Maneuver strategy

KW - Swarm

UR - http://www.scopus.com/inward/record.url?scp=85123991707&partnerID=8YFLogxK

U2 - 10.1007/s43684-021-00013-z

DO - 10.1007/s43684-021-00013-z

M3 - 文章

AN - SCOPUS:85123991707

SN - 2730-616X

VL - 1

JO - Autonomous Intelligent Systems

JF - Autonomous Intelligent Systems

IS - 1

M1 - 15

ER -

Autonomous maneuver strategy of swarm air combat based on DDPG

摘要

访问文件

其它文件与链接

指纹

引用此