TY - GEN
T1 - Improve PID controller through reinforcement learning
AU - Qin, Yunxiao
AU - Zhang, Weiguo
AU - Shi, Jingping
AU - Liu, Jinglong
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/8
Y1 - 2018/8
N2 - Deep Reinforcement Learning (DRL) algorithm provides an effective and powerful way for the computer learning to make decisions according to what the computer sees, the most famous application of DRL is training the computer to play computer games, such as the computer Go and Atari Games. In this paper, we present that the DRL can used to improve the classical PID controller by training an adaptive PID controller with its parameters change according to the changing state, we name our new controller as DRPID, the deep reinforcement learning algorithm that we used to train the PID controller in this paper is deep deterministic policy gradient, which is very suitable to solve continue action control problem. We test our DRPID algorithm by training it to control an inverted pendulum in the OpenAI gym simulation environment, and we found DRPID controller works very well, outperforms the common PID controller with fixed parameters, by a great margin.
AB - Deep Reinforcement Learning (DRL) algorithm provides an effective and powerful way for the computer learning to make decisions according to what the computer sees, the most famous application of DRL is training the computer to play computer games, such as the computer Go and Atari Games. In this paper, we present that the DRL can used to improve the classical PID controller by training an adaptive PID controller with its parameters change according to the changing state, we name our new controller as DRPID, the deep reinforcement learning algorithm that we used to train the PID controller in this paper is deep deterministic policy gradient, which is very suitable to solve continue action control problem. We test our DRPID algorithm by training it to control an inverted pendulum in the OpenAI gym simulation environment, and we found DRPID controller works very well, outperforms the common PID controller with fixed parameters, by a great margin.
UR - http://www.scopus.com/inward/record.url?scp=85082449279&partnerID=8YFLogxK
U2 - 10.1109/GNCC42960.2018.9019095
DO - 10.1109/GNCC42960.2018.9019095
M3 - 会议稿件
AN - SCOPUS:85082449279
T3 - 2018 IEEE CSAA Guidance, Navigation and Control Conference, CGNCC 2018
BT - 2018 IEEE CSAA Guidance, Navigation and Control Conference, CGNCC 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE CSAA Guidance, Navigation and Control Conference, CGNCC 2018
Y2 - 10 August 2018 through 12 August 2018
ER -