Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning

Qiming Yang; Jiandong Zhang; Guoqing Shi; Jinwen Hu; Yong Wu

doi:10.1109/ACCESS.2019.2961426

Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning

Qiming Yang, Jiandong Zhang, Guoqing Shi, Jinwen Hu, Yong Wu

机电学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

164 引用（Scopus）

摘要

With the development of artificial intelligence and integrated sensor technologies, unmanned aerial vehicles (UAVs) are more and more applied in the air combats. A bottleneck that constrains the capability of UAVs against manned vehicles is the autonomous maneuver decision, which is a very challenging problem in the short-range air combat undergoing highly dynamic and uncertain maneuvers of enemies. In this paper, an autonomous maneuver decision model is proposed for the UAV short-range air combat based on reinforcement learning, which mainly includes the aircraft motion model, one-to-one short-range air combat evaluation model and the maneuver decision model based on deep Q network (DQN). However, such model includes a high dimensional state and action space which requires huge computation load for DQN training using traditional methods. Then, a phased training method, called 'basic-confrontation', which is based on the idea that human beings gradually learn from simple to complex is proposed to help reduce the training time while getting suboptimal but efficient results. Finally, one-to-one short-range air combats are simulated under different target maneuver policies. Simulation results show that the proposed maneuver decision model and training method can help the UAV achieve autonomous decision in the air combats and obtain an effective decision policy to defeat the opponent.

源语言	英语
文章编号	8938773
页（从-至）	363-378
页数	16
期刊	IEEE Access
卷	8
DOI	https://doi.org/10.1109/ACCESS.2019.2961426
出版状态	已出版 - 2020

访问文件

10.1109/ACCESS.2019.2961426

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{640c570f2b5741d3b32a88f9a4513afa,

title = "Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning",

abstract = "With the development of artificial intelligence and integrated sensor technologies, unmanned aerial vehicles (UAVs) are more and more applied in the air combats. A bottleneck that constrains the capability of UAVs against manned vehicles is the autonomous maneuver decision, which is a very challenging problem in the short-range air combat undergoing highly dynamic and uncertain maneuvers of enemies. In this paper, an autonomous maneuver decision model is proposed for the UAV short-range air combat based on reinforcement learning, which mainly includes the aircraft motion model, one-to-one short-range air combat evaluation model and the maneuver decision model based on deep Q network (DQN). However, such model includes a high dimensional state and action space which requires huge computation load for DQN training using traditional methods. Then, a phased training method, called 'basic-confrontation', which is based on the idea that human beings gradually learn from simple to complex is proposed to help reduce the training time while getting suboptimal but efficient results. Finally, one-to-one short-range air combats are simulated under different target maneuver policies. Simulation results show that the proposed maneuver decision model and training method can help the UAV achieve autonomous decision in the air combats and obtain an effective decision policy to defeat the opponent.",

keywords = "Deep reinforcement learning, deep Q network, independent decision, maneuver decision, network training",

author = "Qiming Yang and Jiandong Zhang and Guoqing Shi and Jinwen Hu and Yong Wu",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2020",

doi = "10.1109/ACCESS.2019.2961426",

language = "英语",

volume = "8",

pages = "363--378",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning

AU - Yang, Qiming

AU - Zhang, Jiandong

AU - Shi, Guoqing

AU - Hu, Jinwen

AU - Wu, Yong

PY - 2020

Y1 - 2020

N2 - With the development of artificial intelligence and integrated sensor technologies, unmanned aerial vehicles (UAVs) are more and more applied in the air combats. A bottleneck that constrains the capability of UAVs against manned vehicles is the autonomous maneuver decision, which is a very challenging problem in the short-range air combat undergoing highly dynamic and uncertain maneuvers of enemies. In this paper, an autonomous maneuver decision model is proposed for the UAV short-range air combat based on reinforcement learning, which mainly includes the aircraft motion model, one-to-one short-range air combat evaluation model and the maneuver decision model based on deep Q network (DQN). However, such model includes a high dimensional state and action space which requires huge computation load for DQN training using traditional methods. Then, a phased training method, called 'basic-confrontation', which is based on the idea that human beings gradually learn from simple to complex is proposed to help reduce the training time while getting suboptimal but efficient results. Finally, one-to-one short-range air combats are simulated under different target maneuver policies. Simulation results show that the proposed maneuver decision model and training method can help the UAV achieve autonomous decision in the air combats and obtain an effective decision policy to defeat the opponent.

AB - With the development of artificial intelligence and integrated sensor technologies, unmanned aerial vehicles (UAVs) are more and more applied in the air combats. A bottleneck that constrains the capability of UAVs against manned vehicles is the autonomous maneuver decision, which is a very challenging problem in the short-range air combat undergoing highly dynamic and uncertain maneuvers of enemies. In this paper, an autonomous maneuver decision model is proposed for the UAV short-range air combat based on reinforcement learning, which mainly includes the aircraft motion model, one-to-one short-range air combat evaluation model and the maneuver decision model based on deep Q network (DQN). However, such model includes a high dimensional state and action space which requires huge computation load for DQN training using traditional methods. Then, a phased training method, called 'basic-confrontation', which is based on the idea that human beings gradually learn from simple to complex is proposed to help reduce the training time while getting suboptimal but efficient results. Finally, one-to-one short-range air combats are simulated under different target maneuver policies. Simulation results show that the proposed maneuver decision model and training method can help the UAV achieve autonomous decision in the air combats and obtain an effective decision policy to defeat the opponent.

KW - Deep reinforcement learning

KW - deep Q network

KW - independent decision

KW - maneuver decision

KW - network training

UR - http://www.scopus.com/inward/record.url?scp=85078705518&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2019.2961426

DO - 10.1109/ACCESS.2019.2961426

M3 - 文章

AN - SCOPUS:85078705518

SN - 2169-3536

VL - 8

SP - 363

EP - 378

JO - IEEE Access

JF - IEEE Access

M1 - 8938773

ER -

Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此