TY - JOUR
T1 - Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm
AU - Li, Fei
AU - Gao, Xiao Guang
AU - Wan, Kai Fang
N1 - Publisher Copyright:
Copyright © 2017 Acta Automatica Sinica. All rights reserved.
PY - 2017/5
Y1 - 2017/5
N2 - Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.
AB - Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.
KW - Deep learning
KW - GFPT (Gradient fixing parallel tempering)
KW - Markov theory
KW - Parallel tempering
KW - Restricted Boltzmann machine (RBM)
KW - Sampling algorithm
UR - http://www.scopus.com/inward/record.url?scp=85021860612&partnerID=8YFLogxK
U2 - 10.16383/j.aas.2017.c160326
DO - 10.16383/j.aas.2017.c160326
M3 - 文章
AN - SCOPUS:85021860612
SN - 0254-4156
VL - 43
SP - 753
EP - 764
JO - Zidonghua Xuebao/Acta Automatica Sinica
JF - Zidonghua Xuebao/Acta Automatica Sinica
IS - 5
ER -