Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3

Jiazhen Wang, Zhen Yang, Shiyuan Chai, Weiyu Huo, Deyun Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In order to solve the cooperative maneuvering decision problem of UAVs in dual-UAVs formations in air combat, this paper proposes an air combat maneuvering algorithm based on a cooperative reward mechanism and a distributed Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). Firstly, the reward function is designed according to the combat purpose of dual-UAVs air combat. Secondly, To address the sparse reward function problem in air combat, a cooperative reward mechanism is introduced in the reward function based on the idea of cooperative combat in real air combat, and a variable weight superposition method based on the optimal combat distance is introduced in the calculation of immediate reward to reshape the reward function. The dual-UAVs formation confrontation simulation training is conducted under the framework of MATD3 algorithm. The simulation results show that the generated dual-UAVs cooperative air combat maneuver strategy is reasonable and more effective by introducing the collaborative reward mechanism and the combat distance influence factor.

Original languageEnglish
Title of host publication2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages86-91
Number of pages6
ISBN (Electronic)9798350315684
DOIs
StatePublished - 2023
Event11th International Conference on Control, Mechatronics and Automation, ICCMA 2023 - Hybrid, Grimstad, Norway
Duration: 1 Nov 20233 Nov 2023

Publication series

Name2023 11th International Conference on Control, Mechatronics and Automation, ICCMA 2023

Conference

Conference11th International Conference on Control, Mechatronics and Automation, ICCMA 2023
Country/TerritoryNorway
CityHybrid, Grimstad
Period1/11/233/11/23

Keywords

  • air-combat maneuvering decisions
  • collaborative reward mechanism
  • dual-UAVs
  • MATD3

Fingerprint

Dive into the research topics of 'Dual-UAVs Maneuvering Strategy Generation Algorithm Based on Cooperative Reward Mechanism and MATD3'. Together they form a unique fingerprint.

Cite this