A training algorithm and stability analysis for recurrent neural networks

Zhao Xu; Qing Song; Danwei Wang; Haijin Fan

A training algorithm and stability analysis for recurrent neural networks

Zhao Xu, Qing Song, Danwei Wang, Haijin Fan

Nanyang Technological University

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for on-line applications. Conventional RNNs training algorithms such as the backpropagation through time (BPTT) and real-time recurrent learning (RTRL) have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes specific designed three adaptive parameters to maximize training speed for recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.

Original language	English
Title of host publication	15th International Conference on Information Fusion, FUSION 2012
Pages	2285-2292
Number of pages	8
State	Published - 2012
Externally published	Yes
Event	15th International Conference on Information Fusion, FUSION 2012 - Singapore, Singapore Duration: 7 Sep 2012 → 12 Sep 2012

Publication series

Name	15th International Conference on Information Fusion, FUSION 2012

Conference

Conference	15th International Conference on Information Fusion, FUSION 2012
Country/Territory	Singapore
City	Singapore
Period	7/09/12 → 12/09/12

Keywords

recurrent neural networks (RNNs)
simultaneous perturbation stochastic approximation (SPSA) training
weight convergence and stability proofs

Cite this

@inproceedings{94462840bbd14972b76a6bb5b2b7806a,

title = "A training algorithm and stability analysis for recurrent neural networks",

abstract = "Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for on-line applications. Conventional RNNs training algorithms such as the backpropagation through time (BPTT) and real-time recurrent learning (RTRL) have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes specific designed three adaptive parameters to maximize training speed for recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.",

keywords = "recurrent neural networks (RNNs), simultaneous perturbation stochastic approximation (SPSA) training, weight convergence and stability proofs",

author = "Zhao Xu and Qing Song and Danwei Wang and Haijin Fan",

year = "2012",

language = "英语",

isbn = "9780982443859",

series = "15th International Conference on Information Fusion, FUSION 2012",

pages = "2285--2292",

booktitle = "15th International Conference on Information Fusion, FUSION 2012",

note = "15th International Conference on Information Fusion, FUSION 2012 ; Conference date: 07-09-2012 Through 12-09-2012",

}

Xu, Z, Song, Q, Wang, D & Fan, H 2012, A training algorithm and stability analysis for recurrent neural networks. in 15th International Conference on Information Fusion, FUSION 2012., 6290583, 15th International Conference on Information Fusion, FUSION 2012, pp. 2285-2292, 15th International Conference on Information Fusion, FUSION 2012, Singapore, Singapore, 7/09/12.

A training algorithm and stability analysis for recurrent neural networks. / Xu, Zhao; Song, Qing; Wang, Danwei et al.
15th International Conference on Information Fusion, FUSION 2012. 2012. p. 2285-2292 6290583 (15th International Conference on Information Fusion, FUSION 2012).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A training algorithm and stability analysis for recurrent neural networks

AU - Xu, Zhao

AU - Song, Qing

AU - Wang, Danwei

AU - Fan, Haijin

PY - 2012

Y1 - 2012

N2 - Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for on-line applications. Conventional RNNs training algorithms such as the backpropagation through time (BPTT) and real-time recurrent learning (RTRL) have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes specific designed three adaptive parameters to maximize training speed for recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.

AB - Training of recurrent neural networks (RNNs) introduces considerable computational complexities due to the need for gradient evaluations. How to get fast convergence speed and low computational complexity remains a challenging and open topic. Besides, the transient response of learning process of RNNs is a critical issue, especially for on-line applications. Conventional RNNs training algorithms such as the backpropagation through time (BPTT) and real-time recurrent learning (RTRL) have not adequately satisfied these requirements because they often suffer from slow convergence speed. If a large learning rate is chosen to improve performance, the training process may become unstable in terms of weight divergence. In this paper, a novel training algorithm of RNN, named robust recurrent simultaneous perturbation stochastic approximation (RRSPSA), is developed with a specially designed recurrent hybrid adaptive parameter and adaptive learning rates. RRSPSA is a powerful novel twin-engine simultaneous perturbation stochastic approximation (SPSA) type of RNN training algorithm. It utilizes specific designed three adaptive parameters to maximize training speed for recurrent training signal while exhibiting certain weight convergence properties with only two objective function measurements as the original SPSA algorithm. The RRSPSA is proved with guaranteed weight convergence and system stability in the sense of Lyapunov function. Computer simulations were carried out to demonstrate applicability of the theoretical results.

KW - recurrent neural networks (RNNs)

KW - simultaneous perturbation stochastic approximation (SPSA) training

KW - weight convergence and stability proofs

UR - http://www.scopus.com/inward/record.url?scp=84867632637&partnerID=8YFLogxK

M3 - 会议稿件

AN - SCOPUS:84867632637

SN - 9780982443859

T3 - 15th International Conference on Information Fusion, FUSION 2012

SP - 2285

EP - 2292

BT - 15th International Conference on Information Fusion, FUSION 2012

T2 - 15th International Conference on Information Fusion, FUSION 2012

Y2 - 7 September 2012 through 12 September 2012

ER -

A training algorithm and stability analysis for recurrent neural networks

Abstract

Publication series

Conference

Keywords

Other files and links

Fingerprint

Cite this