基于启发强化学习的大规模ADR任务优化方法

Jianan Yang, Xiaolei Hou, Yu Hen Hu, Yong Liu, Quan Pan, Qian Feng

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Vigorous development of the space industry leads to a nonnegligible space debris threat to future space activities. The Active multi-Debris Removal (ADR) technology has become an indispensable means to alleviate this situation. Aiming at the large-scale multi-debris active removal mission planning problem, a Reinforcement Learning (RL) planning scheme is first proposed based on the maximal-reward optimization model for the ADR problem, and the state, action, and reward function of this problem are defined according to the RL framework. Based on an efficient heuristics method, a specialized Monte Carlo Tree Search (MCTS) algorithm is then presented, with the Monte Carlo Tree Search as the core structure and efficient heuristic operators and reinforcement learning iteration process. Finally, its effectiveness is tested in the large-scale complete Iridium 33 debris cloud. The results show that this method is superior to the original MCTS algorithm and the heuristic greedy algorithm.

投稿的翻译标题Heuristic enhanced reinforcement learning method for large-scale multi-debris active removal mission planning
源语言繁体中文
文章编号524354
期刊Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica
42
4
DOI
出版状态已出版 - 25 4月 2021

关键词

  • Active debris removal
  • Heuristic operator
  • Mission planning
  • Monte Carlo tree search
  • Reinforcement learning

指纹

探究 '基于启发强化学习的大规模ADR任务优化方法' 的科研主题。它们共同构成独一无二的指纹。

引用此