TY - GEN
T1 - Multi-Pursuer Multi-Target Encirclement Strategy Based on Multi-Agent Deep Deterministic Policy Gradient
AU - Luo, Xuanyu
AU - Liu, Chuang
AU - Yue, Xiaokui
AU - Ouyang, Chenhao
N1 - Publisher Copyright:
Copyright © 2024 by the International Astronautical Federation (IAF). All rights reserved.
PY - 2024
Y1 - 2024
N2 - This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.
AB - This paper proposes a multi-pursuer multi-target encirclement approach based on multi-agent deep deterministic policy gradient (MADDPG) algorithm, within the context of orbital game for satellite swarms. First, the multiconstraint impulsive orbital game model between the satellite swarm and non-cooperative targets is established using the CW equation and game theory. Then, through analyzing the orbital game process and integrating the Markov decision process (MDP), the MDP model between the multi-pursuer and multi-target encirclement game is developed. A corresponding reward function is designed and the training process of the network based on the MADDPG algorithm is examined for the orbital game mission. Finally, the MADDPG algorithm is applied to solve a typical multi-target orbital game problem, and comparisons with traditional numerical algorithms are performed, which demonstrates the effectiveness and feasibility in multi-target encirclement game for satellite swarms.
KW - encirclement control
KW - MADDPG algorithm
KW - non-cooperative target
KW - Satellite swarm
UR - http://www.scopus.com/inward/record.url?scp=85218468997&partnerID=8YFLogxK
U2 - 10.52202/078367-0084
DO - 10.52202/078367-0084
M3 - 会议稿件
AN - SCOPUS:85218468997
T3 - Proceedings of the International Astronautical Congress, IAC
SP - 785
EP - 791
BT - IAF Space Operations Symposium - Held at the 75th International Astronautical Congress, IAC 2024
PB - International Astronautical Federation, IAF
T2 - 2024 IAF Space Operations Symposium at the 75th International Astronautical Congress, IAC 2024
Y2 - 14 October 2024 through 18 October 2024
ER -