Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator

Xinxin Guo; Weisheng Yan; Rongxin Cui

doi:10.1109/TNNLS.2019.2957287

Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator

Xinxin Guo, Weisheng Yan, Rongxin Cui

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

20 Scopus citations

Abstract

In this article, a synchronous reinforcement-learning-based algorithm is developed for input-constrained partially unknown systems. The proposed control also alleviates the need for an initial stabilizing control. A first-order robust exact differentiator is employed to approximate unknown drift dynamics. Critic, actor, and disturbance neural networks (NNs) are established to approximate the value function, the control policy, and the disturbance policy, respectively. The Hamilton-Jacobi-Isaacs equation is solved by applying the value function approximation technique. The stability of the closed-loop system can be ensured. The state and weight errors of the three NNs are all uniformly ultimately bounded. Finally, the simulation results are provided to verify the effectiveness of the proposed method.

Original language	English
Article number	8943132
Pages (from-to)	4713-4725
Number of pages	13
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	31
Issue number	11
DOIs	https://doi.org/10.1109/TNNLS.2019.2957287
State	Published - Nov 2020

Keywords

First-order robust exact differentiator (RED)
input constraint
neural network (NN)
reinforcement learning (RL)
two-player zero-sum game

Access to Document

10.1109/TNNLS.2019.2957287

Cite this

@article{38e86f8b395848f3bd2576308ab9fc60,

title = "Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator",

abstract = "In this article, a synchronous reinforcement-learning-based algorithm is developed for input-constrained partially unknown systems. The proposed control also alleviates the need for an initial stabilizing control. A first-order robust exact differentiator is employed to approximate unknown drift dynamics. Critic, actor, and disturbance neural networks (NNs) are established to approximate the value function, the control policy, and the disturbance policy, respectively. The Hamilton-Jacobi-Isaacs equation is solved by applying the value function approximation technique. The stability of the closed-loop system can be ensured. The state and weight errors of the three NNs are all uniformly ultimately bounded. Finally, the simulation results are provided to verify the effectiveness of the proposed method.",

keywords = "First-order robust exact differentiator (RED), input constraint, neural network (NN), reinforcement learning (RL), two-player zero-sum game",

author = "Xinxin Guo and Weisheng Yan and Rongxin Cui",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2020",

month = nov,

doi = "10.1109/TNNLS.2019.2957287",

language = "英语",

volume = "31",

pages = "4713--4725",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "11",

}

TY - JOUR

T1 - Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator

AU - Guo, Xinxin

AU - Yan, Weisheng

AU - Cui, Rongxin

PY - 2020/11

Y1 - 2020/11

N2 - In this article, a synchronous reinforcement-learning-based algorithm is developed for input-constrained partially unknown systems. The proposed control also alleviates the need for an initial stabilizing control. A first-order robust exact differentiator is employed to approximate unknown drift dynamics. Critic, actor, and disturbance neural networks (NNs) are established to approximate the value function, the control policy, and the disturbance policy, respectively. The Hamilton-Jacobi-Isaacs equation is solved by applying the value function approximation technique. The stability of the closed-loop system can be ensured. The state and weight errors of the three NNs are all uniformly ultimately bounded. Finally, the simulation results are provided to verify the effectiveness of the proposed method.

AB - In this article, a synchronous reinforcement-learning-based algorithm is developed for input-constrained partially unknown systems. The proposed control also alleviates the need for an initial stabilizing control. A first-order robust exact differentiator is employed to approximate unknown drift dynamics. Critic, actor, and disturbance neural networks (NNs) are established to approximate the value function, the control policy, and the disturbance policy, respectively. The Hamilton-Jacobi-Isaacs equation is solved by applying the value function approximation technique. The stability of the closed-loop system can be ensured. The state and weight errors of the three NNs are all uniformly ultimately bounded. Finally, the simulation results are provided to verify the effectiveness of the proposed method.

KW - First-order robust exact differentiator (RED)

KW - input constraint

KW - neural network (NN)

KW - reinforcement learning (RL)

KW - two-player zero-sum game

UR - http://www.scopus.com/inward/record.url?scp=85077270909&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2019.2957287

DO - 10.1109/TNNLS.2019.2957287

M3 - 文章

C2 - 31880567

AN - SCOPUS:85077270909

SN - 2162-237X

VL - 31

SP - 4713

EP - 4725

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 11

M1 - 8943132

ER -

Reinforcement Learning-Based Nearly Optimal Control for Constrained-Input Partially Unknown Systems Using Differentiator

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this