A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites

Hanyu Qian; Zhaoyue Chen; Xin Wang; Bing Xiao; Ling Meng; Yanan Ma

doi:10.1016/j.ins.2024.121790

A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites

Hanyu Qian, Zhaoyue Chen, Xin Wang, Bing Xiao, Ling Meng, Yanan Ma

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.

源语言	英语
文章编号	121790
期刊	Information Sciences
卷	699
DOI	https://doi.org/10.1016/j.ins.2024.121790
出版状态	已出版 - 5月 2025

访问文件

10.1016/j.ins.2024.121790

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cf0dde85967d417694cf786b00490595,

title = "A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites",

abstract = "The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.",

keywords = "Deep reinforcement learning, Orbit maneuvering, Pursuit-evasion game, Satellite swarm, Swarm-independent behavior, Target-attacker-defender game",

author = "Hanyu Qian and Zhaoyue Chen and Xin Wang and Bing Xiao and Ling Meng and Yanan Ma",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2025",

month = may,

doi = "10.1016/j.ins.2024.121790",

language = "英语",

volume = "699",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites

AU - Qian, Hanyu

AU - Chen, Zhaoyue

AU - Wang, Xin

AU - Xiao, Bing

AU - Meng, Ling

AU - Ma, Yanan

PY - 2025/5

Y1 - 2025/5

N2 - The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.

AB - The target-attacker-defender gaming decision problem for satellites with impulse-thrust orbit maneuvering capability only is studied in this paper. A swarm-independent behaviors-based orbit maneuvering approach is proposed. The satellite maneuvering game problem is first transformed into an optimization problem involving impulse size, maneuvering type, and task objectives. A deep reinforcement learning algorithm is employed to optimize this problem. Specifically, eight swarm-independent behaviors are proposed to guide pulse size selection, involving at least 12 parameters related to the initial orbital states of both sides. Additionally, three auxiliary guidance mechanisms are introduced to reduce the optimization space. Finally, fast, autonomous, and stable game maneuvering is achieved. Unlike the distance-based approaches, the proposed method uses process guidance, incorporating more gaming information and constraints. This leads to a more precise training objective and improved training accuracy. Simulation results show that the success rates of the proposed method are over 11% higher than those achieved by distance-based methods in six versus two target-attacker-defender games.

KW - Deep reinforcement learning

KW - Orbit maneuvering

KW - Pursuit-evasion game

KW - Satellite swarm

KW - Swarm-independent behavior

KW - Target-attacker-defender game

UR - http://www.scopus.com/inward/record.url?scp=85213055515&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2024.121790

DO - 10.1016/j.ins.2024.121790

M3 - 文章

AN - SCOPUS:85213055515

SN - 0020-0255

VL - 699

JO - Information Sciences

JF - Information Sciences

M1 - 121790

ER -

A swarm-independent behaviors-based orbit maneuvering approach for target-attacker-defender games of satellites

摘要

访问文件

其它文件与链接

指纹

引用此