摘要
To address the challenges of incomplete information,high training difficulty,and low coordination efficiency in spacecraft swarm’s cooperative round-up of non-cooperative targets,an improved multi-agent reinforcement learning-based cooperative round-up approach for spacecraft swarm is proposed. Firstly,considering the characteristics of the cooperative round-up task,the dynamic model,and the constraint conditions,a cooperative round-up model for spacecraft swarm is established. Secondly,to overcome the difficulty in converging complex policy networks,residual learning is introduced to optimize the policy network,and the residual network multi-agent proximal policy optimization (RN-MAPPO)algorithm is proposed. Furthermore,considering the unknown maneuvering capability and strategy of the target,a cooperative round-up approach for spacecraft swarm is designed based on the RN-MAPPO algorithm to achieve rapid strategy generation for multi-scenario cooperative round-up. Simulation results demonstrate that the proposed method exhibits favorable convergence and training efficiency,effectively reduces round-up time and fuel consumption,and ensures a high round-up success rate while fully leveraging the synergy of the swarm.
| 投稿的翻译标题 | RN-MAPPO-based Cooperative Round-up Method for Spacecraft Swarms |
|---|---|
| 源语言 | 繁体中文 |
| 页(从-至) | 1091-1102 |
| 页数 | 12 |
| 期刊 | Yuhang Xuebao/Journal of Astronautics |
| 卷 | 47 |
| 期 | 4 |
| DOI | |
| 出版状态 | 已出版 - 2026 |
关键词
- Cooperative round-up
- Multi-agent reinforcement learning
- Orbital game
- Residual learning
- Spacecraft swarm
指纹
探究 '基于 RN-MAPPO 的航天器集群协同围捕方法' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver