Observer-based optimized backstepping control using critic-actor reinforcement learning for morphing aircraft

Haoyu Cheng; Shuo Zhang; Yuanjun Feng; Wenxing Fu; Maolin Ni

doi:10.1016/j.ast.2025.110388

Observer-based optimized backstepping control using critic-actor reinforcement learning for morphing aircraft

Haoyu Cheng, Shuo Zhang, Yuanjun Feng, Wenxing Fu, Maolin Ni

Unmanned System Research Institute

Research output: Contribution to journal › Article › peer-review

Abstract

This study developed an optimized backstepping control scheme for morphing aircraft, integrating observer design with Critic-Actor reinforcement learning (RL) theory to achieve optimal control. The aerodynamic model of the morphing aircraft was first analyzed, leading to the formulation of a nonlinear dynamic model. An improved extended state observer (ESO) was designed to estimate lumped disturbances and imprecise system states, enhancing observation accuracy. A Critic-Actor RL network, combined with the ESO, was then implemented at each control order to generate optimal control commands, ensuring optimal performance while maintaining disturbance resistance. A lightweight RL framework was developed using an adaptive law update method, enhancing practical applicability and differentiating it from traditional RL approaches by eliminating the need for repeated trial-and-error processes. The boundedness of the closed-loop signals was theoretically proven using the Lyapunov function, ensuring the stability and safety of the controller. However, challenges remain when addressing scenarios where the nonlinear dynamics of the aircraft are entirely unknown. Simulation results demonstrated that the proposed algorithm exhibits high robustness and stability under two simulation cases, with optimality guaranteed.

Original language	English
Article number	110388
Journal	Aerospace Science and Technology
Volume	164
DOIs	https://doi.org/10.1016/j.ast.2025.110388
State	Published - Sep 2025

Keywords

Critic-actor framework
Extended state observer(ESO)
Morphing aircraft
Optimal control
Reinforcement learning(RL)

Access to Document

10.1016/j.ast.2025.110388

Cite this

@article{011de20b2a8b4bf3a74007d93a08f9f8,

title = "Observer-based optimized backstepping control using critic-actor reinforcement learning for morphing aircraft",

abstract = "This study developed an optimized backstepping control scheme for morphing aircraft, integrating observer design with Critic-Actor reinforcement learning (RL) theory to achieve optimal control. The aerodynamic model of the morphing aircraft was first analyzed, leading to the formulation of a nonlinear dynamic model. An improved extended state observer (ESO) was designed to estimate lumped disturbances and imprecise system states, enhancing observation accuracy. A Critic-Actor RL network, combined with the ESO, was then implemented at each control order to generate optimal control commands, ensuring optimal performance while maintaining disturbance resistance. A lightweight RL framework was developed using an adaptive law update method, enhancing practical applicability and differentiating it from traditional RL approaches by eliminating the need for repeated trial-and-error processes. The boundedness of the closed-loop signals was theoretically proven using the Lyapunov function, ensuring the stability and safety of the controller. However, challenges remain when addressing scenarios where the nonlinear dynamics of the aircraft are entirely unknown. Simulation results demonstrated that the proposed algorithm exhibits high robustness and stability under two simulation cases, with optimality guaranteed.",

keywords = "Critic-actor framework, Extended state observer(ESO), Morphing aircraft, Optimal control, Reinforcement learning(RL)",

author = "Haoyu Cheng and Shuo Zhang and Yuanjun Feng and Wenxing Fu and Maolin Ni",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier Masson SAS",

year = "2025",

month = sep,

doi = "10.1016/j.ast.2025.110388",

language = "英语",

volume = "164",

journal = "Aerospace Science and Technology",

issn = "1270-9638",

publisher = "Elsevier Masson s.r.l.",

}

TY - JOUR

T1 - Observer-based optimized backstepping control using critic-actor reinforcement learning for morphing aircraft

AU - Cheng, Haoyu

AU - Zhang, Shuo

AU - Feng, Yuanjun

AU - Fu, Wenxing

AU - Ni, Maolin

PY - 2025/9

Y1 - 2025/9

N2 - This study developed an optimized backstepping control scheme for morphing aircraft, integrating observer design with Critic-Actor reinforcement learning (RL) theory to achieve optimal control. The aerodynamic model of the morphing aircraft was first analyzed, leading to the formulation of a nonlinear dynamic model. An improved extended state observer (ESO) was designed to estimate lumped disturbances and imprecise system states, enhancing observation accuracy. A Critic-Actor RL network, combined with the ESO, was then implemented at each control order to generate optimal control commands, ensuring optimal performance while maintaining disturbance resistance. A lightweight RL framework was developed using an adaptive law update method, enhancing practical applicability and differentiating it from traditional RL approaches by eliminating the need for repeated trial-and-error processes. The boundedness of the closed-loop signals was theoretically proven using the Lyapunov function, ensuring the stability and safety of the controller. However, challenges remain when addressing scenarios where the nonlinear dynamics of the aircraft are entirely unknown. Simulation results demonstrated that the proposed algorithm exhibits high robustness and stability under two simulation cases, with optimality guaranteed.

AB - This study developed an optimized backstepping control scheme for morphing aircraft, integrating observer design with Critic-Actor reinforcement learning (RL) theory to achieve optimal control. The aerodynamic model of the morphing aircraft was first analyzed, leading to the formulation of a nonlinear dynamic model. An improved extended state observer (ESO) was designed to estimate lumped disturbances and imprecise system states, enhancing observation accuracy. A Critic-Actor RL network, combined with the ESO, was then implemented at each control order to generate optimal control commands, ensuring optimal performance while maintaining disturbance resistance. A lightweight RL framework was developed using an adaptive law update method, enhancing practical applicability and differentiating it from traditional RL approaches by eliminating the need for repeated trial-and-error processes. The boundedness of the closed-loop signals was theoretically proven using the Lyapunov function, ensuring the stability and safety of the controller. However, challenges remain when addressing scenarios where the nonlinear dynamics of the aircraft are entirely unknown. Simulation results demonstrated that the proposed algorithm exhibits high robustness and stability under two simulation cases, with optimality guaranteed.

KW - Critic-actor framework

KW - Extended state observer(ESO)

KW - Morphing aircraft

KW - Optimal control

KW - Reinforcement learning(RL)

UR - http://www.scopus.com/inward/record.url?scp=105007290432&partnerID=8YFLogxK

U2 - 10.1016/j.ast.2025.110388

DO - 10.1016/j.ast.2025.110388

M3 - 文章

AN - SCOPUS:105007290432

SN - 1270-9638

VL - 164

JO - Aerospace Science and Technology

JF - Aerospace Science and Technology

M1 - 110388

ER -

Observer-based optimized backstepping control using critic-actor reinforcement learning for morphing aircraft

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this