Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm

Fei Li; Xiao Guang Gao; Kai Fang Wan

doi:10.16383/j.aas.2017.c160326

Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm

Fei Li, Xiao Guang Gao, Kai Fang Wan

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.

源语言	英语
页（从-至）	753-764
页数	12
期刊	Zidonghua Xuebao/Acta Automatica Sinica
卷	43
期	5
DOI	https://doi.org/10.16383/j.aas.2017.c160326
出版状态	已出版 - 5月 2017

访问文件

10.16383/j.aas.2017.c160326

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{ebb2b802f61241528a6f8e74a75c75c9,

title = "Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm",

abstract = "Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.",

keywords = "Deep learning, GFPT (Gradient fixing parallel tempering), Markov theory, Parallel tempering, Restricted Boltzmann machine (RBM), Sampling algorithm",

author = "Fei Li and Gao, {Xiao Guang} and Wan, {Kai Fang}",

year = "2017",

month = may,

doi = "10.16383/j.aas.2017.c160326",

language = "英语",

volume = "43",

pages = "753--764",

journal = "Zidonghua Xuebao/Acta Automatica Sinica",

issn = "0254-4156",

publisher = "Science Press ",

number = "5",

}

TY - JOUR

T1 - Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm

AU - Li, Fei

AU - Gao, Xiao Guang

AU - Wan, Kai Fang

PY - 2017/5

Y1 - 2017/5

N2 - Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.

AB - Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.

KW - Deep learning

KW - GFPT (Gradient fixing parallel tempering)

KW - Markov theory

KW - Parallel tempering

KW - Restricted Boltzmann machine (RBM)

KW - Sampling algorithm

UR - http://www.scopus.com/inward/record.url?scp=85021860612&partnerID=8YFLogxK

U2 - 10.16383/j.aas.2017.c160326

DO - 10.16383/j.aas.2017.c160326

M3 - 文章

AN - SCOPUS:85021860612

SN - 0254-4156

VL - 43

SP - 753

EP - 764

JO - Zidonghua Xuebao/Acta Automatica Sinica

JF - Zidonghua Xuebao/Acta Automatica Sinica

IS - 5

ER -

Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm

摘要

访问文件

其它文件与链接

指纹

引用此