Abstract
In tackling the optimal safety control problem of spacecraft attitude within the presence of multiple constraint regions, this study proposes a reinforcement learning-based control algorithm rooted in non-zero-sum game theory. Initially, a multi-forbidden region model is devised to capture the spacecraft's attitude dynamics, which is subsequently mapped onto a multi-input non-zero-sum game framework. In the next step, a reinforcement learning strategy, utilizing an “Actor-critic” architecture, is employed to approximate the Nash equilibrium solution of the non-zero-sum game. The evaluation network continuously assesses the spacecraft's current control state across the multiple forbidden regions, thereby guiding the action network to minimize the evaluation network's output. This process facilitates the coordinated management of repulsive forces imposed by various forbidden regions on the spacecraft's attitude, ultimately ensuring the system achieves the Nash equilibrium and optimizes the attitude control despite the imposed constraints. Additionally, leveraging Lyapunov stability theory, the stability of the proposed control strategy is rigorously validated. Finally, simulation results substantiate the effectiveness and robustness of the approach, underscoring its potential for real-world applications within complex constrained environments.
| Original language | English |
|---|---|
| Article number | 107929 |
| Journal | Journal of the Franklin Institute |
| Volume | 362 |
| Issue number | 13 |
| DOIs | |
| State | Published - 15 Aug 2025 |
Keywords
- Multiple constraint regions
- Neural network
- Non-zero-sum game
- Reinforcement learning
- Spacecraft
Fingerprint
Dive into the research topics of 'Optimal safe control of spacecraft attitude in multi-constrained regions based on non-zero-sum game'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver