TY - JOUR
T1 - Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty
AU - Luo, Chengfeng
AU - Ning, Xin
AU - Tang, Rugang
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2025
Y1 - 2025
N2 - In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.
AB - In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.
KW - Attitude control
KW - optimal control
KW - persistence of excitation
KW - reinforcement learning
KW - system identification
UR - http://www.scopus.com/inward/record.url?scp=105003652055&partnerID=8YFLogxK
U2 - 10.1109/TAES.2025.3564579
DO - 10.1109/TAES.2025.3564579
M3 - 文章
AN - SCOPUS:105003652055
SN - 0018-9251
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
ER -