TY - JOUR
T1 - Reinforcement learning output feedback NN control using deterministic learning technique
AU - Xu, Bin
AU - Yang, Chenguang
AU - Shi, Zhongke
PY - 2014/3
Y1 - 2014/3
N2 - In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.
AB - In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic NN control architecture includes two NNs, whereas the critic NN is used to approximate the strategic utility function, and the action NN is employed to minimize both the strategic utility function and the tracking error. A deterministic learning technique has been employed to guarantee that the partial persistent excitation condition of internal states is satisfied during tracking control to a periodic reference orbit. The uniformly ultimate boundedness of closed-loop signals is shown via Lyapunov stability analysis. Simulation results are presented to demonstrate the effectiveness of the proposed control.
KW - Approximate dynamic programming
KW - discrete-time system
KW - output feedback control
KW - pure-feedback system
KW - radial basis function neural network (RBF NN)
UR - http://www.scopus.com/inward/record.url?scp=84897663275&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2013.2292704
DO - 10.1109/TNNLS.2013.2292704
M3 - 文章
C2 - 24807456
AN - SCOPUS:84897663275
SN - 2162-237X
VL - 25
SP - 635
EP - 641
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
IS - 3
M1 - 6681972
ER -