Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

Yuhe Zhang; Zhen Yang; Shiyuan Chai; Yupeng He; Xingyu Wang; Deyun Zhou

doi:10.23919/CCC58697.2023.10240246

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

Yuhe Zhang, Zhen Yang, Shiyuan Chai, Yupeng He, Xingyu Wang, Deyun Zhou

School of Electronics and Information

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

2 Scopus citations

Abstract

Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

Original language	English
Title of host publication	2023 42nd Chinese Control Conference, CCC 2023
Publisher	IEEE Computer Society
Pages	3946-3953
Number of pages	8
ISBN (Electronic)	9789887581543
DOIs	https://doi.org/10.23919/CCC58697.2023.10240246
State	Published - 2023
Event	42nd Chinese Control Conference, CCC 2023 - Tianjin, China Duration: 24 Jul 2023 → 26 Jul 2023

Publication series

Name	Chinese Control Conference, CCC
Volume	2023-July
ISSN (Print)	1934-1768
ISSN (Electronic)	2161-2927

Conference

Conference	42nd Chinese Control Conference, CCC 2023
Country/Territory	China
City	Tianjin
Period	24/07/23 → 26/07/23

Keywords

Air Combat
Hybrid Action Space
Missile Launch Strategy
Proximal Policy Optimization
Reinforcement Learning

Access to Document

10.23919/CCC58697.2023.10240246

Cite this

Zhang, Y., Yang, Z., Chai, S., He, Y., Wang, X., & Zhou, D. (2023). Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. In 2023 42nd Chinese Control Conference, CCC 2023 (pp. 3946-3953). (Chinese Control Conference, CCC; Vol. 2023-July). IEEE Computer Society. https://doi.org/10.23919/CCC58697.2023.10240246

@inproceedings{9bb7f03698cf4140889dc04ac2c756cc,

title = "Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization",

abstract = "Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.",

keywords = "Air Combat, Hybrid Action Space, Missile Launch Strategy, Proximal Policy Optimization, Reinforcement Learning",

author = "Yuhe Zhang and Zhen Yang and Shiyuan Chai and Yupeng He and Xingyu Wang and Deyun Zhou",

note = "Publisher Copyright: {\textcopyright} 2023 Technical Committee on Control Theory, Chinese Association of Automation.; 42nd Chinese Control Conference, CCC 2023 ; Conference date: 24-07-2023 Through 26-07-2023",

year = "2023",

doi = "10.23919/CCC58697.2023.10240246",

language = "英语",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "3946--3953",

booktitle = "2023 42nd Chinese Control Conference, CCC 2023",

}

Zhang, Y, Yang, Z, Chai, S, He, Y, Wang, X & Zhou, D 2023, Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. in 2023 42nd Chinese Control Conference, CCC 2023. Chinese Control Conference, CCC, vol. 2023-July, IEEE Computer Society, pp. 3946-3953, 42nd Chinese Control Conference, CCC 2023, Tianjin, China, 24/07/23. https://doi.org/10.23919/CCC58697.2023.10240246

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization. / Zhang, Yuhe; Yang, Zhen; Chai, Shiyuan et al.
2023 42nd Chinese Control Conference, CCC 2023. IEEE Computer Society, 2023. p. 3946-3953 (Chinese Control Conference, CCC; Vol. 2023-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

AU - Zhang, Yuhe

AU - Yang, Zhen

AU - Chai, Shiyuan

AU - He, Yupeng

AU - Wang, Xingyu

AU - Zhou, Deyun

PY - 2023

Y1 - 2023

N2 - Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

AB - Reinforcement learning algorithm usually only improves maneuver strategy by the strength and weakness of the Air combat situation, but ignores the basic air combat attack task, whether the missile hits the target or not, and the hybrid action space problem caused by discrete missile launch strategy and continuous maneuver strategy. In order to solve the problem, this paper designs a reinforcement learning method based on proximal policy optimization, In this method, two separate policy networks are used to solve the hybrid action space problem caused by the discrete missile launch action and the continuous maneuver action. Whether the missile hits the target is taken as the evaluation system, and the missile launch action and maneuver action are jointly modeled. Thus complete the air combat task from the situation occupation through maneuvering action to the missile launch action guiding the missile to destroy the target. Finally, the intelligence level of the generation strategy is verified by the simulation experiment of UAV 1 versus 1 air combat attack mission under different initial situations. The results show that the maneuvering strategy and missile launching strategy generated by this algorithm are reasonable and can complete the designed air combat task.

KW - Air Combat

KW - Hybrid Action Space

KW - Missile Launch Strategy

KW - Proximal Policy Optimization

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85175562790&partnerID=8YFLogxK

U2 - 10.23919/CCC58697.2023.10240246

DO - 10.23919/CCC58697.2023.10240246

M3 - 会议稿件

AN - SCOPUS:85175562790

T3 - Chinese Control Conference, CCC

SP - 3946

EP - 3953

BT - 2023 42nd Chinese Control Conference, CCC 2023

PB - IEEE Computer Society

T2 - 42nd Chinese Control Conference, CCC 2023

Y2 - 24 July 2023 through 26 July 2023

ER -

Maneuver and Attack Strategy Generation Method for Autonomous Air Combat in Hybrid Action Space Based on Proximal Policy Optimization

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this