Research on online reinforcement learning method based on experience-replay

Ning Hu; Zhijun Ge; Xuanwen Chen; Chunguang Ding; Haobin Shi

doi:10.1109/ICInfA.2018.8812454

Research on online reinforcement learning method based on experience-replay

Ning Hu, Zhijun Ge, Xuanwen Chen, Chunguang Ding, Haobin Shi

School of Computer Science

China Electronic Product Reliability and Environmental Testing Research Institute

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

As for standard reinforcement learning, the key is that the agent's next step is directed by the instantaneous and delayed reporting from constant interaction with the environment and trial and error learning. But it makes the convergence rate slower for actual reinforcement learning; at the same time, inconsistency state will occur in the agent learning process. Therefore, it is necessary for the agent to remember what has been learned within the time specified to improve the convergence and robustness of decision making. With regard to the above-mentioned issues, this paper proposes to accelerate the convergence rate of reinforcement learning by using the function approximation ability of neural network and to improve the robustness of reinforcement learning by using the Memory-based Experience-Replay(ER) algorithm. The experimental results show the effectiveness of the proposed method.

Original language	English
Title of host publication	2018 IEEE International Conference on Information and Automation, ICIA 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1338-1343
Number of pages	6
ISBN (Electronic)	9781538680698
DOIs	https://doi.org/10.1109/ICInfA.2018.8812454
State	Published - Aug 2018
Event	2018 IEEE International Conference on Information and Automation, ICIA 2018 - Wuyishan, Fujian, China Duration: 11 Aug 2018 → 13 Aug 2018

Publication series

Name	2018 IEEE International Conference on Information and Automation, ICIA 2018

Conference

Conference	2018 IEEE International Conference on Information and Automation, ICIA 2018
Country/Territory	China
City	Wuyishan, Fujian
Period	11/08/18 → 13/08/18

Keywords

Experience-Replay
Neural Network
Reinforcement Learning

Access to Document

10.1109/ICInfA.2018.8812454

Cite this

Hu, N., Ge, Z., Chen, X., Ding, C., & Shi, H. (2018). Research on online reinforcement learning method based on experience-replay. In 2018 IEEE International Conference on Information and Automation, ICIA 2018 (pp. 1338-1343). Article 8812454 (2018 IEEE International Conference on Information and Automation, ICIA 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICInfA.2018.8812454

@inproceedings{42114767130348baa703382829eb7bfb,

title = "Research on online reinforcement learning method based on experience-replay",

abstract = "As for standard reinforcement learning, the key is that the agent's next step is directed by the instantaneous and delayed reporting from constant interaction with the environment and trial and error learning. But it makes the convergence rate slower for actual reinforcement learning; at the same time, inconsistency state will occur in the agent learning process. Therefore, it is necessary for the agent to remember what has been learned within the time specified to improve the convergence and robustness of decision making. With regard to the above-mentioned issues, this paper proposes to accelerate the convergence rate of reinforcement learning by using the function approximation ability of neural network and to improve the robustness of reinforcement learning by using the Memory-based Experience-Replay(ER) algorithm. The experimental results show the effectiveness of the proposed method.",

keywords = "Experience-Replay, Neural Network, Reinforcement Learning",

author = "Ning Hu and Zhijun Ge and Xuanwen Chen and Chunguang Ding and Haobin Shi",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 2018 IEEE International Conference on Information and Automation, ICIA 2018 ; Conference date: 11-08-2018 Through 13-08-2018",

year = "2018",

month = aug,

doi = "10.1109/ICInfA.2018.8812454",

language = "英语",

series = "2018 IEEE International Conference on Information and Automation, ICIA 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1338--1343",

booktitle = "2018 IEEE International Conference on Information and Automation, ICIA 2018",

}

Hu, N, Ge, Z, Chen, X, Ding, C & Shi, H 2018, Research on online reinforcement learning method based on experience-replay. in 2018 IEEE International Conference on Information and Automation, ICIA 2018., 8812454, 2018 IEEE International Conference on Information and Automation, ICIA 2018, Institute of Electrical and Electronics Engineers Inc., pp. 1338-1343, 2018 IEEE International Conference on Information and Automation, ICIA 2018, Wuyishan, Fujian, China, 11/08/18. https://doi.org/10.1109/ICInfA.2018.8812454

Research on online reinforcement learning method based on experience-replay. / Hu, Ning; Ge, Zhijun; Chen, Xuanwen et al.
2018 IEEE International Conference on Information and Automation, ICIA 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 1338-1343 8812454 (2018 IEEE International Conference on Information and Automation, ICIA 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Research on online reinforcement learning method based on experience-replay

AU - Hu, Ning

AU - Ge, Zhijun

AU - Chen, Xuanwen

AU - Ding, Chunguang

AU - Shi, Haobin

PY - 2018/8

Y1 - 2018/8

N2 - As for standard reinforcement learning, the key is that the agent's next step is directed by the instantaneous and delayed reporting from constant interaction with the environment and trial and error learning. But it makes the convergence rate slower for actual reinforcement learning; at the same time, inconsistency state will occur in the agent learning process. Therefore, it is necessary for the agent to remember what has been learned within the time specified to improve the convergence and robustness of decision making. With regard to the above-mentioned issues, this paper proposes to accelerate the convergence rate of reinforcement learning by using the function approximation ability of neural network and to improve the robustness of reinforcement learning by using the Memory-based Experience-Replay(ER) algorithm. The experimental results show the effectiveness of the proposed method.

AB - As for standard reinforcement learning, the key is that the agent's next step is directed by the instantaneous and delayed reporting from constant interaction with the environment and trial and error learning. But it makes the convergence rate slower for actual reinforcement learning; at the same time, inconsistency state will occur in the agent learning process. Therefore, it is necessary for the agent to remember what has been learned within the time specified to improve the convergence and robustness of decision making. With regard to the above-mentioned issues, this paper proposes to accelerate the convergence rate of reinforcement learning by using the function approximation ability of neural network and to improve the robustness of reinforcement learning by using the Memory-based Experience-Replay(ER) algorithm. The experimental results show the effectiveness of the proposed method.

KW - Experience-Replay

KW - Neural Network

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85072349253&partnerID=8YFLogxK

U2 - 10.1109/ICInfA.2018.8812454

DO - 10.1109/ICInfA.2018.8812454

M3 - 会议稿件

AN - SCOPUS:85072349253

T3 - 2018 IEEE International Conference on Information and Automation, ICIA 2018

SP - 1338

EP - 1343

BT - 2018 IEEE International Conference on Information and Automation, ICIA 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2018 IEEE International Conference on Information and Automation, ICIA 2018

Y2 - 11 August 2018 through 13 August 2018

ER -

Hu N, Ge Z, Chen X, Ding C, Shi H. Research on online reinforcement learning method based on experience-replay. In 2018 IEEE International Conference on Information and Automation, ICIA 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1338-1343. 8812454. (2018 IEEE International Conference on Information and Automation, ICIA 2018). doi: 10.1109/ICInfA.2018.8812454

Research on online reinforcement learning method based on experience-replay

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this