Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3

Jiazhen Wang; Zhen Yang; Shiyuan Chai; Weiyu Huo; Deyun Zhou

doi:10.1109/ICCMA59762.2023.10374675

Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3

Jiazhen Wang, Zhen Yang, Shiyuan Chai, Weiyu Huo, Deyun Zhou

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

3 Scopus citations

Abstract

In order to solve the cooperative maneuvering decision problem of UAVs in dual-UAVs formations in air combat, this paper proposes an air combat maneuvering algorithm based on a cooperative reward mechanism and a distributed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). Firstly, the reward function is designed according to the combat purpose of dual-UAVs air combat. Secondly, To address the sparse reward function problem in air combat, a cooperative reward mechanism is introduced in the reward function based on the idea of cooperative combat in real air combat, and a variable weight superposition method based on the optimal combat distance is introduced in the calculation of immediate reward to reshape the reward function. The dual-UAVs formation confrontation simulation training is conducted under the framework of MATD3 algorithm. The simulation results show that the generated dual-UAVs cooperative air combat maneuver strategy is reasonable and more effective by introducing the collaborative reward mechanism and the combat distance influence factor.

Original language	English
Title of host publication	2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	86-91
Number of pages	6
ISBN (Electronic)	9798350315684
DOIs	https://doi.org/10.1109/ICCMA59762.2023.10374675
State	Published - 2023
Event	11th International Conference on Control, Mechatronics and Automation, ICCMA 2023 - Hybrid, Grimstad, Norway Duration: 1 Nov 2023 → 3 Nov 2023

Publication series

Name	2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023

Conference

Conference	11th International Conference on Control, Mechatronics and Automation, ICCMA 2023
Country/Territory	Norway
City	Hybrid, Grimstad
Period	1/11/23 → 3/11/23

Keywords

air-combat maneuvering decisions
collaborative reward mechanism
dual-UAVs
MATD3

Access to Document

10.1109/ICCMA59762.2023.10374675

Cite this

Wang, J., Yang, Z., Chai, S., Huo, W., & Zhou, D. (2023). Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3. In 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023 (pp. 86-91). (2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCMA59762.2023.10374675

Wang, Jiazhen ; Yang, Zhen ; Chai, Shiyuan et al. / Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3. 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 86-91 (2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023).

@inproceedings{7d85f2e8fff64a479fc769f26b98c89f,

title = "Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3",

abstract = "In order to solve the cooperative maneuvering decision problem of UAVs in dual-UAVs formations in air combat, this paper proposes an air combat maneuvering algorithm based on a cooperative reward mechanism and a distributed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). Firstly, the reward function is designed according to the combat purpose of dual-UAVs air combat. Secondly, To address the sparse reward function problem in air combat, a cooperative reward mechanism is introduced in the reward function based on the idea of cooperative combat in real air combat, and a variable weight superposition method based on the optimal combat distance is introduced in the calculation of immediate reward to reshape the reward function. The dual-UAVs formation confrontation simulation training is conducted under the framework of MATD3 algorithm. The simulation results show that the generated dual-UAVs cooperative air combat maneuver strategy is reasonable and more effective by introducing the collaborative reward mechanism and the combat distance influence factor.",

keywords = "air-combat maneuvering decisions, collaborative reward mechanism, dual-UAVs, MATD3",

author = "Jiazhen Wang and Zhen Yang and Shiyuan Chai and Weiyu Huo and Deyun Zhou",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023 ; Conference date: 01-11-2023 Through 03-11-2023",

year = "2023",

doi = "10.1109/ICCMA59762.2023.10374675",

language = "英语",

series = "2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "86--91",

booktitle = "2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023",

}

Wang, J, Yang, Z, Chai, S, Huo, W & Zhou, D 2023, Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3. in 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023. 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023, Institute of Electrical and Electronics Engineers Inc., pp. 86-91, 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023, Hybrid, Grimstad, Norway, 1/11/23. https://doi.org/10.1109/ICCMA59762.2023.10374675

Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3. / Wang, Jiazhen; Yang, Zhen; Chai, Shiyuan et al.
2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 86-91 (2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3

AU - Wang, Jiazhen

AU - Yang, Zhen

AU - Chai, Shiyuan

AU - Huo, Weiyu

AU - Zhou, Deyun

PY - 2023

Y1 - 2023

N2 - In order to solve the cooperative maneuvering decision problem of UAVs in dual-UAVs formations in air combat, this paper proposes an air combat maneuvering algorithm based on a cooperative reward mechanism and a distributed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). Firstly, the reward function is designed according to the combat purpose of dual-UAVs air combat. Secondly, To address the sparse reward function problem in air combat, a cooperative reward mechanism is introduced in the reward function based on the idea of cooperative combat in real air combat, and a variable weight superposition method based on the optimal combat distance is introduced in the calculation of immediate reward to reshape the reward function. The dual-UAVs formation confrontation simulation training is conducted under the framework of MATD3 algorithm. The simulation results show that the generated dual-UAVs cooperative air combat maneuver strategy is reasonable and more effective by introducing the collaborative reward mechanism and the combat distance influence factor.

AB - In order to solve the cooperative maneuvering decision problem of UAVs in dual-UAVs formations in air combat, this paper proposes an air combat maneuvering algorithm based on a cooperative reward mechanism and a distributed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). Firstly, the reward function is designed according to the combat purpose of dual-UAVs air combat. Secondly, To address the sparse reward function problem in air combat, a cooperative reward mechanism is introduced in the reward function based on the idea of cooperative combat in real air combat, and a variable weight superposition method based on the optimal combat distance is introduced in the calculation of immediate reward to reshape the reward function. The dual-UAVs formation confrontation simulation training is conducted under the framework of MATD3 algorithm. The simulation results show that the generated dual-UAVs cooperative air combat maneuver strategy is reasonable and more effective by introducing the collaborative reward mechanism and the combat distance influence factor.

KW - air-combat maneuvering decisions

KW - collaborative reward mechanism

KW - dual-UAVs

KW - MATD3

UR - http://www.scopus.com/inward/record.url?scp=85183588174&partnerID=8YFLogxK

U2 - 10.1109/ICCMA59762.2023.10374675

DO - 10.1109/ICCMA59762.2023.10374675

M3 - 会议稿件

AN - SCOPUS:85183588174

T3 - 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023

SP - 86

EP - 91

BT - 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023

Y2 - 1 November 2023 through 3 November 2023

ER -

Wang J, Yang Z, Chai S, Huo W, Zhou D. Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3. In 2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 86-91. (2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023). doi: 10.1109/ICCMA59762.2023.10374675

Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this