UAV Swarm Confrontation Based on Multi-Agent Soft Actor-Critic Method

Yongkang Jiao, Wenxing Fu, Xinying Cao, Yaping Wang, Pengfei Xu, Yusheng Wang, Lanlin Yu, Haibo Du

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The UAV swarm confrontation learning environment faces complex challenges due to its high dimensionality, nonlinearity, incomplete information, and continuous action space.To address these issues, this study proposes a multi-agent soft actor-critic (MASAC) deep reinforcement learning method based on incomplete information. Built on the centralized training-distributed execution (CTDE) framework,the proposed method establishes a UAV swarm confrontation game model and simulates a multi-UAV combat environment in continuous space. Simulation results demonstrate that the MASAC method outperforms existing multi-agent deep reinforcement learning techniques in terms of convergence speed and stability. The results of this study under score the practicality and effectiveness of the MASAC method in enabling intelligent decision-making for UAV swarms, thereby offering essential technical support for future advancements in UAV operations.

Original languageEnglish
Title of host publicationProceedings of 2024 IEEE International Conference on Unmanned Systems, ICUS 2024
EditorsRong Song
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages878-883
Number of pages6
ISBN (Electronic)9798350384185
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Unmanned Systems, ICUS 2024 - Nanjing, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameProceedings of 2024 IEEE International Conference on Unmanned Systems, ICUS 2024

Conference

Conference2024 IEEE International Conference on Unmanned Systems, ICUS 2024
Country/TerritoryChina
CityNanjing
Period18/10/2420/10/24

Keywords

  • deep reinforcement learning
  • game theory
  • multi-agent systems

Fingerprint

Dive into the research topics of 'UAV Swarm Confrontation Based on Multi-Agent Soft Actor-Critic Method'. Together they form a unique fingerprint.

Cite this