Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach

Xiaoxiang Hu, Kejun Dong, Teng Yang, Bing Xiao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The tracking control of hypersonic flight vehicle (HFV) is discussed in this paper, and the nonlinear model of HFV is assumed to be completely unknown. This problem is surely challenging because of the missing prior knowledge, but is more closer to reality since the exact mode of HFV is difficult to be obtained. A reinforcement learning (RL) based optimal controller is proposed for the tracking control of HFV. A model based RL algorithm is firstly proposed and then, based on this algorithm, a model free algorithm is constructed. For relaxing the environmental conditions, neural network (NN) is adopted for the approximation of Critic and Actor, and then a Greedy Policy based updated learning law for NN is derived. The presented RL based control strategy is carried on the nonlinear model of HFV to show its effectiveness.

Original languageEnglish
Title of host publication2022 IEEE 20th International Conference on Industrial Informatics, INDIN 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages711-717
Number of pages7
ISBN (Electronic)9781728175683
DOIs
StatePublished - 2022
Event20th IEEE International Conference on Industrial Informatics, INDIN 2022 - Perth, Australia
Duration: 25 Jul 202228 Jul 2022

Publication series

NameIEEE International Conference on Industrial Informatics (INDIN)
Volume2022-July
ISSN (Print)1935-4576

Conference

Conference20th IEEE International Conference on Industrial Informatics, INDIN 2022
Country/TerritoryAustralia
CityPerth
Period25/07/2228/07/22

Keywords

  • Hypersonic flight vehicles (HFV)
  • Model free
  • reinforcement learning(RL)

Fingerprint

Dive into the research topics of 'Reinforcement Learning based Optimal Tracking Control for Hypersonic Flight Vehicle: A Model Free Approach'. Together they form a unique fingerprint.

Cite this