Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty

Chengfeng Luo; Xin Ning; Rugang Tang

doi:10.1109/TAES.2025.3564579

Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty

Chengfeng Luo, Xin Ning, Rugang Tang

航天学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.

源语言	英语
期刊	IEEE Transactions on Aerospace and Electronic Systems
DOI	https://doi.org/10.1109/TAES.2025.3564579
出版状态	已接受/待刊 - 2025

访问文件

10.1109/TAES.2025.3564579

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{c8a1976f35504efe9ca631a352e4c730,

title = "Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty",

abstract = "In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.",

keywords = "Attitude control, optimal control, persistence of excitation, reinforcement learning, system identification",

author = "Chengfeng Luo and Xin Ning and Rugang Tang",

note = "Publisher Copyright: {\textcopyright} 1965-2011 IEEE.",

year = "2025",

doi = "10.1109/TAES.2025.3564579",

language = "英语",

journal = "IEEE Transactions on Aerospace and Electronic Systems",

issn = "0018-9251",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty

AU - Luo, Chengfeng

AU - Ning, Xin

AU - Tang, Rugang

PY - 2025

Y1 - 2025

N2 - In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.

AB - In this paper, we discuss an online reinforcement learning (RL) algorithm to solve the optimal control problem for rigid spacecraft attitude control systems with dynamic uncertainty. The RL algorithm adapts in real time, finding the optimal control policy estimation while guaranteeing the stability of the closed-loop system and the algorithm convergence. To address dynamic uncertainty, we introduce a two-phase learning structure implementing recursive computations based on the measurable system state, rather than relying on prior knowledge of the system's dynamic model. A sufficient condition for the algorithm convergence is presented, ensuring that the control policy converges to the optimal controller within finite iterations of the learning process. Comparative simulations are conducted to illustrate the validity and advantages of the proposed algorithm.

KW - Attitude control

KW - optimal control

KW - persistence of excitation

KW - reinforcement learning

KW - system identification

UR - http://www.scopus.com/inward/record.url?scp=105003652055&partnerID=8YFLogxK

U2 - 10.1109/TAES.2025.3564579

DO - 10.1109/TAES.2025.3564579

M3 - 文章

AN - SCOPUS:105003652055

SN - 0018-9251

JO - IEEE Transactions on Aerospace and Electronic Systems

JF - IEEE Transactions on Aerospace and Electronic Systems

ER -

Online Optimal Attitude Stabilization Via Reinforcement Learning for Rigid Spacecraft With Dynamic Uncertainty

摘要

访问文件

其它文件与链接

指纹

引用此