Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient

Xuanyu Luo; Chuang Liu; Xiaokui Yue; Chenhao Ouyang

doi:10.52202/078367-0084

Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient

Xuanyu Luo, Chuang Liu, Xiaokui Yue, Chenhao Ouyang

School of Astronautics

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.

Original language	English
Title of host publication	IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024
Publisher	International Astronautical Federation, IAF
Pages	785-791
Number of pages	7
ISBN (Electronic)	9798331312183
DOIs	https://doi.org/10.52202/078367-0084
State	Published - 2024
Event	2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024 - Milan, Italy Duration: 14 Oct 2024 → 18 Oct 2024

Publication series

Name	Proceedings of the International Astronautical Congress, IAC
ISSN (Print)	0074-1795

Conference

Conference	2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024
Country/Territory	Italy
City	Milan
Period	14/10/24 → 18/10/24

Keywords

encirclement control
MADDPG algorithm
non-cooperative target
Satellite swarm

Access to Document

10.52202/078367-0084

Cite this

Luo, X., Liu, C., Yue, X., & Ouyang, C. (2024). Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient. In IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024 (pp. 785-791). (Proceedings of the International Astronautical Congress, IAC). International Astronautical Federation, IAF. https://doi.org/10.52202/078367-0084

Luo, Xuanyu ; Liu, Chuang ; Yue, Xiaokui et al. / Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient. IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024. International Astronautical Federation, IAF, 2024. pp. 785-791 (Proceedings of the International Astronautical Congress, IAC).

@inproceedings{97906b8994774cbca17862b519b2f039,

title = "Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient",

abstract = "This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.",

keywords = "encirclement control, MADDPG algorithm, non-cooperative target, Satellite swarm",

author = "Xuanyu Luo and Chuang Liu and Xiaokui Yue and Chenhao Ouyang",

note = "Publisher Copyright: Copyright {\textcopyright} 2024 by the International Astronautical Federation (IAF). All rights reserved.; 2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024 ; Conference date: 14-10-2024 Through 18-10-2024",

year = "2024",

doi = "10.52202/078367-0084",

language = "英语",

series = "Proceedings of the International Astronautical Congress, IAC",

publisher = "International Astronautical Federation, IAF",

pages = "785--791",

booktitle = "IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024",

}

Luo, X, Liu, C, Yue, X & Ouyang, C 2024, Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient. in IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024. Proceedings of the International Astronautical Congress, IAC, International Astronautical Federation, IAF, pp. 785-791, 2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024, Milan, Italy, 14/10/24. https://doi.org/10.52202/078367-0084

Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient. / Luo, Xuanyu; Liu, Chuang; Yue, Xiaokui et al.
IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024. International Astronautical Federation, IAF, 2024. p. 785-791 (Proceedings of the International Astronautical Congress, IAC).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient

AU - Luo, Xuanyu

AU - Liu, Chuang

AU - Yue, Xiaokui

AU - Ouyang, Chenhao

PY - 2024

Y1 - 2024

N2 - This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.

AB - This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.

KW - encirclement control

KW - MADDPG algorithm

KW - non-cooperative target

KW - Satellite swarm

UR - http://www.scopus.com/inward/record.url?scp=85218468997&partnerID=8YFLogxK

U2 - 10.52202/078367-0084

DO - 10.52202/078367-0084

M3 - 会议稿件

AN - SCOPUS:85218468997

T3 - Proceedings of the International Astronautical Congress, IAC

SP - 785

EP - 791

BT - IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024

PB - International Astronautical Federation, IAF

T2 - 2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024

Y2 - 14 October 2024 through 18 October 2024

ER -

Luo X, Liu C, Yue X, Ouyang C. Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient. In IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024. International Astronautical Federation, IAF. 2024. p. 785-791. (Proceedings of the International Astronautical Congress, IAC). doi: 10.52202/078367-0084

Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this