Training restricted boltzmann machine using gradient fixing based algorithm

Fei Li; Xiaoguang Gao; Kaifang Wan

doi:10.1049/cje.2018.05.007

Training restricted boltzmann machine using gradient fixing based algorithm

Fei Li, Xiaoguang Gao, Kaifang Wan

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

7 引用（Scopus）

摘要

Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

源语言	英语
页（从-至）	694-703
页数	10
期刊	Chinese Journal of Electronics
卷	27
期	4
DOI	https://doi.org/10.1049/cje.2018.05.007
出版状态	已出版 - 10 7月 2018

访问文件

10.1049/cje.2018.05.007

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cfbc14e09de642378d2908e3f0323ef0,

title = "Training restricted boltzmann machine using gradient fixing based algorithm",

abstract = "Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.",

keywords = "Deep Learning, Gibbs sampling training algorithm (GFGS), Gradient fixing, Gradient fixing based parallel tempering algorithm (GFPT), Restricted Boltzmann machine (RBM)",

author = "Fei Li and Xiaoguang Gao and Kaifang Wan",

year = "2018",

month = jul,

day = "10",

doi = "10.1049/cje.2018.05.007",

language = "英语",

volume = "27",

pages = "694--703",

journal = "Chinese Journal of Electronics",

issn = "1022-4653",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Training restricted boltzmann machine using gradient fixing based algorithm

AU - Li, Fei

AU - Gao, Xiaoguang

AU - Wan, Kaifang

PY - 2018/7/10

Y1 - 2018/7/10

N2 - Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

AB - Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

KW - Deep Learning

KW - Gibbs sampling training algorithm (GFGS)

KW - Gradient fixing

KW - Gradient fixing based parallel tempering algorithm (GFPT)

KW - Restricted Boltzmann machine (RBM)

UR - http://www.scopus.com/inward/record.url?scp=85051354897&partnerID=8YFLogxK

U2 - 10.1049/cje.2018.05.007

DO - 10.1049/cje.2018.05.007

M3 - 文章

AN - SCOPUS:85051354897

SN - 1022-4653

VL - 27

SP - 694

EP - 703

JO - Chinese Journal of Electronics

JF - Chinese Journal of Electronics

IS - 4

ER -

Training restricted boltzmann machine using gradient fixing based algorithm

摘要

访问文件

其它文件与链接

指纹

引用此