TY - JOUR
T1 - Adaptive Dynamic Programming Approach on Optimal Control for Affinely Pseudo- Linearized Nonlinear System
AU - Zhang, Lin
AU - Zhu, Yian
AU - Shi, Haobin
AU - Hwang, Kao Shing
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2019
Y1 - 2019
N2 - This paper concerns the optimal control problem of a general affinely pseudo-linearized nonlinear system and puts forward the solving scheme based on adaptive dynamic programming and neural network. In detail, a new more general affinely pseudo-linearized nonlinear system model is presented and covers more practical scenarios than existing models in engineering. In consideration of the Bellman optimal principle, the corresponding model particulars are then analyzed and its adaptive dynamic programming solving scheme is derived. That is, the modified policy iteration is used to estimate the value function and optimal control at each sampled state, and then the calculated state-control pairs and state critic values are utilized to train two neural networks (one works as state evaluator and the other works as optimal control generator, respectively). Through the above, the discrete state space is smoothed into a continuous state space, and the notorious dimensional curse problem could be addressed. Moreover, in order to improve the efficiency and precision of the network training, the samples calculating and network training processes are simultaneously carried out in this paper. In the end, a related simulation experiment is applied and the results demonstrate that the proposed method has more effectiveness and validation for the optimal control problem of the newly proposed general system model than the compared methods.
AB - This paper concerns the optimal control problem of a general affinely pseudo-linearized nonlinear system and puts forward the solving scheme based on adaptive dynamic programming and neural network. In detail, a new more general affinely pseudo-linearized nonlinear system model is presented and covers more practical scenarios than existing models in engineering. In consideration of the Bellman optimal principle, the corresponding model particulars are then analyzed and its adaptive dynamic programming solving scheme is derived. That is, the modified policy iteration is used to estimate the value function and optimal control at each sampled state, and then the calculated state-control pairs and state critic values are utilized to train two neural networks (one works as state evaluator and the other works as optimal control generator, respectively). Through the above, the discrete state space is smoothed into a continuous state space, and the notorious dimensional curse problem could be addressed. Moreover, in order to improve the efficiency and precision of the network training, the samples calculating and network training processes are simultaneously carried out in this paper. In the end, a related simulation experiment is applied and the results demonstrate that the proposed method has more effectiveness and validation for the optimal control problem of the newly proposed general system model than the compared methods.
KW - adaptive dynamic programming
KW - neural network
KW - Optimal control
KW - system identification
UR - http://www.scopus.com/inward/record.url?scp=85068330716&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2920867
DO - 10.1109/ACCESS.2019.2920867
M3 - 文章
AN - SCOPUS:85068330716
SN - 2169-3536
VL - 7
SP - 75132
EP - 75142
JO - IEEE Access
JF - IEEE Access
M1 - 8731993
ER -