Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites

Yizhai Zhang; Wenhui Wang; Piaoqi Zhang; Panfeng Huang

doi:10.1109/MAES.2021.3089252

Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites

Yizhai Zhang, Wenhui Wang, Piaoqi Zhang, Panfeng Huang

School of Astronautics

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.

Original language	English
Pages (from-to)	38-47
Number of pages	10
Journal	IEEE Aerospace and Electronic Systems Magazine
Volume	37
Issue number	6
DOIs	https://doi.org/10.1109/MAES.2021.3089252
State	Published - 1 Jun 2022

Access to Document

10.1109/MAES.2021.3089252

Cite this

@article{faedfa1603154586bb0c4fb3c5f84a06,

title = "Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites",

abstract = "Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.",

author = "Yizhai Zhang and Wenhui Wang and Piaoqi Zhang and Panfeng Huang",

note = "Publisher Copyright: {\textcopyright} 1986-2012 IEEE.",

year = "2022",

month = jun,

day = "1",

doi = "10.1109/MAES.2021.3089252",

language = "英语",

volume = "37",

pages = "38--47",

journal = "IEEE Aerospace and Electronic Systems Magazine",

issn = "0885-8985",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "6",

}

TY - JOUR

T1 - Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites

AU - Zhang, Yizhai

AU - Wang, Wenhui

AU - Zhang, Piaoqi

AU - Huang, Panfeng

PY - 2022/6/1

Y1 - 2022/6/1

N2 - Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.

AB - Cellular satellites, which are composed of many standard unit cells, represent a class of novel and promising satellites for future space explorations. Their potentials have been well recognized in the aerospace field. The most attractive feature of cellular satellites is the on-orbit self-reconfiguration capability through cell-by-cell moves. However, it is extremely challenging for a cellular satellite to autonomously achieve the optimal self-reconfiguration with fewest cell moves, because the search space for legal actions may be larger than that of the game of Go if the satellite has a certain number of cells. In this article, we propose a reinforcement learning-based task planning strategy for the self-reconfiguration of cellular satellites. Inspired by the recent progress of AlphaGo and AlphaGo Zero, we calculate the cell move sequence and predict the cell placements in the self-reconfiguration process by combining the Monte Carlo tree search and the neural network. The reinforcement learning-based task planning strategy is validated by comparing with the traditional melt-sort-grow algorithm. The validation results demonstrate that the proposed strategy can significantly reduce the number of cell moves for the self-reconfiguration of cellular satellites.

UR - http://www.scopus.com/inward/record.url?scp=85122075038&partnerID=8YFLogxK

U2 - 10.1109/MAES.2021.3089252

DO - 10.1109/MAES.2021.3089252

M3 - 文章

AN - SCOPUS:85122075038

SN - 0885-8985

VL - 37

SP - 38

EP - 47

JO - IEEE Aerospace and Electronic Systems Magazine

JF - IEEE Aerospace and Electronic Systems Magazine

IS - 6

ER -

Reinforcement-Learning-Based Task Planning for Self-Reconfiguration of Cellular Satellites

Abstract

Access to Document

Other files and links

Fingerprint

Cite this