Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

Yuhe Zhang; Zhen Yang; Shiyuan Chai; Yupeng He; Xingyu Wang; Deyun Zhou

doi:10.23919/CCC58697.2023.10240246

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

Yuhe Zhang, Zhen Yang, Shiyuan Chai, Yupeng He, Xingyu Wang, Deyun Zhou

电子信息学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

2 引用（Scopus）

摘要

Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

源语言	英语
主期刊名	2023 42nd Chinese Control Conference, CCC 2023
出版商	IEEE Computer Society
页	3946-3953
页数	8
ISBN（电子版）	9789887581543
DOI	https://doi.org/10.23919/CCC58697.2023.10240246
出版状态	已出版 - 2023
活动	42nd Chinese Control Conference, CCC 2023 - Tianjin, 中国期限: 24 7月 2023 → 26 7月 2023

出版系列

姓名	Chinese Control Conference, CCC
卷	2023-July
ISSN（印刷版）	1934-1768
ISSN（电子版）	2161-2927

会议

会议	42nd Chinese Control Conference, CCC 2023
国家/地区	中国
市	Tianjin
时期	24/07/23 → 26/07/23

访问文件

10.23919/CCC58697.2023.10240246

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, Y., Yang, Z., Chai, S., He, Y., Wang, X., & Zhou, D. (2023). Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. 在 2023 42nd Chinese Control Conference, CCC 2023 (页码 3946-3953). (Chinese Control Conference, CCC; 卷 2023-July). IEEE Computer Society. https://doi.org/10.23919/CCC58697.2023.10240246

@inproceedings{9bb7f03698cf4140889dc04ac2c756cc,

title = "Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization",

abstract = "Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.",

keywords = "Air Combat, Hybrid Action Space, Missile Launch Strategy, Proximal Policy Optimization, Reinforcement Learning",

author = "Yuhe Zhang and Zhen Yang and Shiyuan Chai and Yupeng He and Xingyu Wang and Deyun Zhou",

note = "Publisher Copyright: {\textcopyright} 2023 Technical Committee on Control Theory, Chinese Association of Automation.; 42nd Chinese Control Conference, CCC 2023 ; Conference date: 24-07-2023 Through 26-07-2023",

year = "2023",

doi = "10.23919/CCC58697.2023.10240246",

language = "英语",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "3946--3953",

booktitle = "2023 42nd Chinese Control Conference, CCC 2023",

}

Zhang, Y, Yang, Z, Chai, S, He, Y, Wang, X & Zhou, D 2023, Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. 在 2023 42nd Chinese Control Conference, CCC 2023. Chinese Control Conference, CCC, 卷 2023-July, IEEE Computer Society, 页码 3946-3953, 42nd Chinese Control Conference, CCC 2023, Tianjin, 中国, 24/07/23. https://doi.org/10.23919/CCC58697.2023.10240246

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. / Zhang, Yuhe; Yang, Zhen; Chai, Shiyuan 等.
2023 42nd Chinese Control Conference, CCC 2023. IEEE Computer Society, 2023. 页码 3946-3953 (Chinese Control Conference, CCC; 卷 2023-July).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

AU - Zhang, Yuhe

AU - Yang, Zhen

AU - Chai, Shiyuan

AU - He, Yupeng

AU - Wang, Xingyu

AU - Zhou, Deyun

PY - 2023

Y1 - 2023

N2 - Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

AB - Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

KW - Air Combat

KW - Hybrid Action Space

KW - Missile Launch Strategy

KW - Proximal Policy Optimization

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85175562790&partnerID=8YFLogxK

U2 - 10.23919/CCC58697.2023.10240246

DO - 10.23919/CCC58697.2023.10240246

M3 - 会议稿件

AN - SCOPUS:85175562790

T3 - Chinese Control Conference, CCC

SP - 3946

EP - 3953

BT - 2023 42nd Chinese Control Conference, CCC 2023

PB - IEEE Computer Society

T2 - 42nd Chinese Control Conference, CCC 2023

Y2 - 24 July 2023 through 26 July 2023

ER -

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此