Reinforcement learning output feedback NN control using deterministic learning technique

Bin Xu; Chenguang Yang; Zhongke Shi

doi:10.1109/TNNLS.2013.2292704

Reinforcement learning output feedback NN control using deterministic learning technique

Bin Xu, Chenguang Yang, Zhongke Shi

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

246 引用（Scopus）

摘要

In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.

源语言	英语
文章编号	6681972
页（从-至）	635-641
页数	7
期刊	IEEE Transactions on Neural Networks and Learning Systems
卷	25
期	3
DOI	https://doi.org/10.1109/TNNLS.2013.2292704
出版状态	已出版 - 3月 2014

访问文件

10.1109/TNNLS.2013.2292704

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{0f306531a73b407bae1862cdc1a445c2,

title = "Reinforcement learning output feedback NN control using deterministic learning technique",

abstract = "In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.",

keywords = "Approximate dynamic programming, discrete-time system, output feedback control, pure-feedback system, radial basis function neural network (RBF NN)",

author = "Bin Xu and Chenguang Yang and Zhongke Shi",

year = "2014",

month = mar,

doi = "10.1109/TNNLS.2013.2292704",

language = "英语",

volume = "25",

pages = "635--641",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "3",

}

TY - JOUR

T1 - Reinforcement learning output feedback NN control using deterministic learning technique

AU - Xu, Bin

AU - Yang, Chenguang

AU - Shi, Zhongke

PY - 2014/3

Y1 - 2014/3

N2 - In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.

AB - In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.

KW - Approximate dynamic programming

KW - discrete-time system

KW - output feedback control

KW - pure-feedback system

KW - radial basis function neural network (RBF NN)

UR - http://www.scopus.com/inward/record.url?scp=84897663275&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2013.2292704

DO - 10.1109/TNNLS.2013.2292704

M3 - 文章

C2 - 24807456

AN - SCOPUS:84897663275

SN - 2162-237X

VL - 25

SP - 635

EP - 641

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 3

M1 - 6681972

ER -

Reinforcement learning output feedback NN control using deterministic learning technique

摘要

访问文件

其它文件与链接

指纹

引用此