Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient

Xuanyu Luo, Chuang Liu, Xiaokui Yue, Chenhao Ouyang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.

Original languageEnglish
Title of host publicationIAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024
PublisherInternational Astronautical Federation, IAF
Pages785-791
Number of pages7
ISBN (Electronic)9798331312183
DOIs
StatePublished - 2024
Event2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024 - Milan, Italy
Duration: 14 Oct 202418 Oct 2024

Publication series

NameProceedings of the International Astronautical Congress, IAC
ISSN (Print)0074-1795

Conference

Conference2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024
Country/TerritoryItaly
CityMilan
Period14/10/2418/10/24

Keywords

  • encirclement control
  • MADDPG algorithm
  • non-cooperative target
  • Satellite swarm

Fingerprint

Dive into the research topics of 'Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient'. Together they form a unique fingerprint.

Cite this