Training restricted boltzmann machine using gradient fixing based algorithm

Fei Li; Xiaoguang Gao; Kaifang Wan

doi:10.1049/cje.2018.05.007

Training restricted boltzmann machine using gradient fixing based algorithm

Fei Li, Xiaoguang Gao, Kaifang Wan

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

7 Scopus citations

Abstract

Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

Original language	English
Pages (from-to)	694-703
Number of pages	10
Journal	Chinese Journal of Electronics
Volume	27
Issue number	4
DOIs	https://doi.org/10.1049/cje.2018.05.007
State	Published - 10 Jul 2018

Keywords

Deep Learning
Gibbs sampling training algorithm (GFGS)
Gradient fixing
Gradient fixing based parallel tempering algorithm (GFPT)
Restricted Boltzmann machine (RBM)

Access to Document

10.1049/cje.2018.05.007

Cite this

@article{cfbc14e09de642378d2908e3f0323ef0,

title = "Training restricted boltzmann machine using gradient fixing based algorithm",

abstract = "Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.",

keywords = "Deep Learning, Gibbs sampling training algorithm (GFGS), Gradient fixing, Gradient fixing based parallel tempering algorithm (GFPT), Restricted Boltzmann machine (RBM)",

author = "Fei Li and Xiaoguang Gao and Kaifang Wan",

year = "2018",

month = jul,

day = "10",

doi = "10.1049/cje.2018.05.007",

language = "英语",

volume = "27",

pages = "694--703",

journal = "Chinese Journal of Electronics",

issn = "1022-4653",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Training restricted boltzmann machine using gradient fixing based algorithm

AU - Li, Fei

AU - Gao, Xiaoguang

AU - Wan, Kaifang

PY - 2018/7/10

Y1 - 2018/7/10

N2 - Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

AB - Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

KW - Deep Learning

KW - Gibbs sampling training algorithm (GFGS)

KW - Gradient fixing

KW - Gradient fixing based parallel tempering algorithm (GFPT)

KW - Restricted Boltzmann machine (RBM)

UR - http://www.scopus.com/inward/record.url?scp=85051354897&partnerID=8YFLogxK

U2 - 10.1049/cje.2018.05.007

DO - 10.1049/cje.2018.05.007

M3 - 文章

AN - SCOPUS:85051354897

SN - 1022-4653

VL - 27

SP - 694

EP - 703

JO - Chinese Journal of Electronics

JF - Chinese Journal of Electronics

IS - 4

ER -

Training restricted boltzmann machine using gradient fixing based algorithm

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this