Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning

Weiren Kong; Deyun Zhou; Kai Zhang; Zhen Yang; Wansha Yang

doi:10.1109/ICMLA51294.2020.00238

Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning

Weiren Kong, Deyun Zhou, Kai Zhang, Zhen Yang, Wansha Yang

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

10 引用（Scopus）

摘要

We present an approach for learning a reactive maneuver strategy for a UCAV formation involved in a short-range multi-UCAV air combat engagement. Specifically, we define an efficient state representation, which breaks down the complexity caused by the large state space in a multi-UCAV air combat engagement. Then a parameter sharing dueling deep Q-network (PS-DDQN) algorithm is proposed to train the UCAV formation. The learning reactive maneuver strategy is shared among our UCAVs to encourage cooperative behaviors. In addition, curriculum learning and self-play extend the maneuver strategy to more difficult scenarios. Thus, speeding up the training process and improving the learning effect. Finally, the effectiveness of the algorithm and the intelligence degree of maneuver strategy is verified by the simulation test of convergence and maneuver strategy quality.

源语言	英语
主期刊名	Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020
编辑	M. Arif Wani, Feng Luo, Xiaolin Li, Dejing Dou, Francesco Bonchi
出版商	Institute of Electrical and Electronics Engineers Inc.
页	1174-1181
页数	8
ISBN（电子版）	9781728184708
DOI	https://doi.org/10.1109/ICMLA51294.2020.00238
出版状态	已出版 - 12月 2020
活动	19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 - Virtual, Miami, 美国期限: 14 12月 2020 → 17 12月 2020

出版系列

姓名	Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020

会议

会议	19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020
国家/地区	美国
市	Virtual, Miami
时期	14/12/20 → 17/12/20

访问文件

10.1109/ICMLA51294.2020.00238

其它文件与链接

链接到 Scopus 的出版物

引用此

Kong, W., Zhou, D., Zhang, K., Yang, Z., & Yang, W. (2020). Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning. 在 M. A. Wani, F. Luo, X. Li, D. Dou, & F. Bonchi (编辑), Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 (页码 1174-1181). 文章 9356234 (Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMLA51294.2020.00238

Kong, Weiren ; Zhou, Deyun ; Zhang, Kai 等. / Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning. Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020. 编辑 / M. Arif Wani ; Feng Luo ; Xiaolin Li ; Dejing Dou ; Francesco Bonchi. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 1174-1181 (Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020).

@inproceedings{c1310bec8d5f4e8d8cc54e1de4c25851,

title = "Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning",

abstract = "We present an approach for learning a reactive maneuver strategy for a UCAV formation involved in a short-range multi-UCAV air combat engagement. Specifically, we define an efficient state representation, which breaks down the complexity caused by the large state space in a multi-UCAV air combat engagement. Then a parameter sharing dueling deep Q-network (PS-DDQN) algorithm is proposed to train the UCAV formation. The learning reactive maneuver strategy is shared among our UCAVs to encourage cooperative behaviors. In addition, curriculum learning and self-play extend the maneuver strategy to more difficult scenarios. Thus, speeding up the training process and improving the learning effect. Finally, the effectiveness of the algorithm and the intelligence degree of maneuver strategy is verified by the simulation test of convergence and maneuver strategy quality.",

keywords = "air combat, curriculum learning, Multi-UCAV, reinforcement learning, training simulations",

author = "Weiren Kong and Deyun Zhou and Kai Zhang and Zhen Yang and Wansha Yang",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020 ; Conference date: 14-12-2020 Through 17-12-2020",

year = "2020",

month = dec,

doi = "10.1109/ICMLA51294.2020.00238",

language = "英语",

series = "Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1174--1181",

editor = "Wani, {M. Arif} and Feng Luo and Xiaolin Li and Dejing Dou and Francesco Bonchi",

booktitle = "Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020",

}

Kong, W, Zhou, D , Zhang, K , Yang, Z & Yang, W 2020, Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning. 在 MA Wani, F Luo, X Li, D Dou & F Bonchi (编辑), Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020., 9356234, Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020, Institute of Electrical and Electronics Engineers Inc., 页码 1174-1181, 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020, Virtual, Miami, 美国, 14/12/20. https://doi.org/10.1109/ICMLA51294.2020.00238

Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning. / Kong, Weiren; Zhou, Deyun ; Zhang, Kai 等.
Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020. 编辑 / M. Arif Wani; Feng Luo; Xiaolin Li; Dejing Dou; Francesco Bonchi. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 1174-1181 9356234 (Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning

AU - Kong, Weiren

AU - Zhou, Deyun

AU - Zhang, Kai

AU - Yang, Zhen

AU - Yang, Wansha

PY - 2020/12

Y1 - 2020/12

N2 - We present an approach for learning a reactive maneuver strategy for a UCAV formation involved in a short-range multi-UCAV air combat engagement. Specifically, we define an efficient state representation, which breaks down the complexity caused by the large state space in a multi-UCAV air combat engagement. Then a parameter sharing dueling deep Q-network (PS-DDQN) algorithm is proposed to train the UCAV formation. The learning reactive maneuver strategy is shared among our UCAVs to encourage cooperative behaviors. In addition, curriculum learning and self-play extend the maneuver strategy to more difficult scenarios. Thus, speeding up the training process and improving the learning effect. Finally, the effectiveness of the algorithm and the intelligence degree of maneuver strategy is verified by the simulation test of convergence and maneuver strategy quality.

AB - We present an approach for learning a reactive maneuver strategy for a UCAV formation involved in a short-range multi-UCAV air combat engagement. Specifically, we define an efficient state representation, which breaks down the complexity caused by the large state space in a multi-UCAV air combat engagement. Then a parameter sharing dueling deep Q-network (PS-DDQN) algorithm is proposed to train the UCAV formation. The learning reactive maneuver strategy is shared among our UCAVs to encourage cooperative behaviors. In addition, curriculum learning and self-play extend the maneuver strategy to more difficult scenarios. Thus, speeding up the training process and improving the learning effect. Finally, the effectiveness of the algorithm and the intelligence degree of maneuver strategy is verified by the simulation test of convergence and maneuver strategy quality.

KW - air combat

KW - curriculum learning

KW - Multi-UCAV

KW - reinforcement learning

KW - training simulations

UR - http://www.scopus.com/inward/record.url?scp=85102487176&partnerID=8YFLogxK

U2 - 10.1109/ICMLA51294.2020.00238

DO - 10.1109/ICMLA51294.2020.00238

M3 - 会议稿件

AN - SCOPUS:85102487176

T3 - Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020

SP - 1174

EP - 1181

BT - Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020

A2 - Wani, M. Arif

A2 - Luo, Feng

A2 - Li, Xiaolin

A2 - Dou, Dejing

A2 - Bonchi, Francesco

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020

Y2 - 14 December 2020 through 17 December 2020

ER -

Kong W, Zhou D , Zhang K , Yang Z, Yang W. Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning. 在 Wani MA, Luo F, Li X, Dou D, Bonchi F, 编辑, Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020. Institute of Electrical and Electronics Engineers Inc. 2020. 页码 1174-1181. 9356234. (Proceedings - 19th IEEE International Conference on Machine Learning and Applications, ICMLA 2020). doi: 10.1109/ICMLA51294.2020.00238

Multi-UCAV Air Combat in Short-Range Maneuver Strategy Generation using Reinforcement Learning and Curriculum Learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此