Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics

X. Ning; S. Cao; B. Han; Z. Wang; Y. Yin

doi:10.1017/aer.2023.36

Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics

X. Ning, S. Cao, B. Han, Z. Wang, Y. Yin

School of Astronautics

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.

Original language	English
Pages (from-to)	292-308
Number of pages	17
Journal	Aeronautical Journal
Volume	128
Issue number	1320
DOIs	https://doi.org/10.1017/aer.2023.36
State	Published - 1 Feb 2024

Keywords

reinforcement learning
straight air compound missile system
super-twisting disturbance observer
unmodeled dynamics

Access to Document

10.1017/aer.2023.36

Cite this

@article{9b2d1fb09c7046d6a2656a30e9e05efd,

title = "Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics",

abstract = "In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.",

keywords = "reinforcement learning, straight air compound missile system, super-twisting disturbance observer, unmodeled dynamics",

author = "X. Ning and S. Cao and B. Han and Z. Wang and Y. Yin",

note = "Publisher Copyright: {\textcopyright} The Author(s), 2023. Published by Cambridge University Press on behalf of Royal Aeronautical Society.",

year = "2024",

month = feb,

day = "1",

doi = "10.1017/aer.2023.36",

language = "英语",

volume = "128",

pages = "292--308",

journal = "Aeronautical Journal",

issn = "0001-9240",

publisher = "Cambridge University Press",

number = "1320",

}

TY - JOUR

T1 - Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics

AU - Ning, X.

AU - Cao, S.

AU - Han, B.

AU - Wang, Z.

AU - Yin, Y.

N1 - Publisher Copyright: © The Author(s), 2023. Published by Cambridge University Press on behalf of Royal Aeronautical Society.

PY - 2024/2/1

Y1 - 2024/2/1

N2 - In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.

AB - In this paper, a super-twisting disturbance observer (STDO)-based adaptive reinforcement learning control scheme is proposed for the straight air compound missile system with aerodynamic uncertainties and unmodeled dynamics. Firstly, neural network (NN)-based adaptive reinforcement learning control scheme with actor-critic design is investigated to deal with the tracking problems for the straight gas compound system. The actor NN and the critic NN are utilised to cope with the unmodeled dynamics and approximate the cost function that are related to control input and tracking error, respectively. In other words, the actor NN is used to perform the tracking control behaviours, and the critic NN aims to evaluate the tracking performance and give feedback to actor NN. Moreover, with the aid of the STDO disturbance observer, the problem of the control signal fluctuation caused by the mismatched disturbance can be solved well. Based on the proposed adaptive law and the Lyapunov direct method, the eventually consistent boundedness of the straight gas compound system is proved. Finally, numerical simulations are carried out to demonstrate the feasibility and superiority of the proposed reinforcement learning-based STDO control algorithm.

KW - reinforcement learning

KW - straight air compound missile system

KW - super-twisting disturbance observer

KW - unmodeled dynamics

UR - http://www.scopus.com/inward/record.url?scp=85164819735&partnerID=8YFLogxK

U2 - 10.1017/aer.2023.36

DO - 10.1017/aer.2023.36

M3 - 文章

AN - SCOPUS:85164819735

SN - 0001-9240

VL - 128

SP - 292

EP - 308

JO - Aeronautical Journal

JF - Aeronautical Journal

IS - 1320

ER -

Adaptive reinforcement learning control for a class of missiles with aerodynamic uncertainties and unmodeled dynamics

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this