TY - JOUR
T1 - Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics
AU - Ning, X.
AU - Cao, S.
AU - Han, B.
AU - Wang, Z.
AU - Yin, Y.
N1 - Publisher Copyright:
© The Author(s), 2023. Published by Cambridge University Press on behalf of Royal Aeronautical Society.
PY - 2024/2/1
Y1 - 2024/2/1
N2 - In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.
AB - In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.
KW - reinforcement learning
KW - straight air compound missile system
KW - super-twisting disturbance observer
KW - unmodeled dynamics
UR - http://www.scopus.com/inward/record.url?scp=85164819735&partnerID=8YFLogxK
U2 - 10.1017/aer.2023.36
DO - 10.1017/aer.2023.36
M3 - 文章
AN - SCOPUS:85164819735
SN - 0001-9240
VL - 128
SP - 292
EP - 308
JO - Aeronautical Journal
JF - Aeronautical Journal
IS - 1320
ER -