TY - JOUR
T1 - Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites
AU - Zhang, Yizhai
AU - Wang, Wenhui
AU - Zhang, Piaoqi
AU - Huang, Panfeng
N1 - Publisher Copyright:
© 1986-2012 IEEE.
PY - 2022/6/1
Y1 - 2022/6/1
N2 - Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.
AB - Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.
UR - http://www.scopus.com/inward/record.url?scp=85122075038&partnerID=8YFLogxK
U2 - 10.1109/MAES.2021.3089252
DO - 10.1109/MAES.2021.3089252
M3 - 文章
AN - SCOPUS:85122075038
SN - 0885-8985
VL - 37
SP - 38
EP - 47
JO - IEEE Aerospace and Electronic Systems Magazine
JF - IEEE Aerospace and Electronic Systems Magazine
IS - 6
ER -