TY - JOUR
T1 - A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites
AU - Qian, Hanyu
AU - Chen, Zhaoyue
AU - Wang, Xin
AU - Xiao, Bing
AU - Meng, Ling
AU - Ma, Yanan
N1 - Publisher Copyright:
© 2024
PY - 2025/5
Y1 - 2025/5
N2 - The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.
AB - The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.
KW - Deep reinforcement learning
KW - Orbit maneuvering
KW - Pursuit-evasion game
KW - Satellite swarm
KW - Swarm-independent behavior
KW - Target-attacker-defender game
UR - http://www.scopus.com/inward/record.url?scp=85213055515&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2024.121790
DO - 10.1016/j.ins.2024.121790
M3 - 文章
AN - SCOPUS:85213055515
SN - 0020-0255
VL - 699
JO - Information Sciences
JF - Information Sciences
M1 - 121790
ER -