Skip to main navigation Skip to search Skip to main content

Consistency Intelligence Control Method for Coordinated Pursuit of Non-Cooperative Targets in Multi-Spacecraft Systems

  • Suyi Liu
  • , Xuyang Cao
  • , Tongshu Zhang
  • , Xin Ning
  • , Xiaobin Lian
  • Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of space technology, the space mission mode has evolved from single spacecraft global planning to multi-spacecraft formation, cluster, and constellation collaborative mode, providing a robust foundation for achieving tasks with higher complexity and larger spatial scale. Regarding the cutting-edge topic of non-cooperative target group pursuit and encirclement, although existing research has extensively explored methods for cooperative interception of spacecraft clusters in orbit, there are still two major bottlenecks in conventional planning algorithms: The strongly coupled constraints, which make it difficult to effectively handle dynamic allocation problems in concurrent multi-target pursuit-evade scenarios of multiple non cooperative targets; The traditional method of relying on ground-based measurement and control systems for target information acquisition and instruction uploading has significant time delays, making it difficult to adapt to the real-time evolution of dynamic space environments. Therefore, this study proposes a multi-objective multi-round pursuit and evasion method based on Relative Reachable Domains and Multi-Agent Deep Deterministic Policy Gradient (RRD-MADDPG) algorithm, which constructs a theoretical framework for spatial multi-agent pursuit and evasion problems. Firstly, based on high-precision orbital dynamics models and spacecraft relative motion models, a multi-round pulse-based pursuit-evasion model is established between two clusters; Secondly, by combining the realtime orbit status of both clusters, we predict the reachable domain of the trajectory within impulsive maneuver intervals, and construct a dynamic database of the Relative Reachable Domain (RRD) of the non-cooperative targets; Furthermore, an innovative”chase fill” reward mechanism is designed for the multi-agent pursuit and evasion model, which effectively mitigates the convergence problem caused by manual reward function design. Finally, the RRD-MADDPG method is numerically validated in a typical multi-agent pursuit and evasion problem. Preliminary experimental results demonstrate that, compared with the traditional MADDPG method, the proposed approach achieves faster convergence and a higher target capture completion rate, highlighting its potential advantages in dynamic space pursuit-evasion scenarios.

Original languageEnglish
Title of host publicationIAF Astrodynamics Symposium - Held at the 76th International Astronautical Congress, IAC 2025
PublisherInternational Astronautical Federation, IAF
Pages1482-1493
Number of pages12
ISBN (Electronic)9798331329358
DOIs
StatePublished - 2025
Event2025 IAF Astrodynamics Symposium at the 76th International Astronautical Congress, IAC 2025 - Sydney, Australia
Duration: 29 Sep 20253 Oct 2025

Publication series

NameProceedings of the International Astronautical Congress, IAC
Volume2-F219391
ISSN (Print)0074-1795

Conference

Conference2025 IAF Astrodynamics Symposium at the 76th International Astronautical Congress, IAC 2025
Country/TerritoryAustralia
CitySydney
Period29/09/253/10/25

Keywords

  • Multi-agent deep deterministic policy gradient
  • Multi-agent reinforcement learning
  • Pursuit-Evasion Game
  • Relative Reachable Domain
  • Spacecraft Cluster Planning

Fingerprint

Dive into the research topics of 'Consistency Intelligence Control Method for Coordinated Pursuit of Non-Cooperative Targets in Multi-Spacecraft Systems'. Together they form a unique fingerprint.

Cite this