Deterministic Convergence Analysis and Application of Elman Neural Network via Sparse Mechanism and Entropy Error Function

Qian Kang; Dengxiu Yu; Bowen Xu; Zhen Wang

doi:10.1109/TNNLS.2025.3562223

Deterministic Convergence Analysis and Application of Elman Neural Network via Sparse Mechanism and Entropy Error Function

Qian Kang, Dengxiu Yu, Bowen Xu, Zhen Wang

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In this study, we employed the batch gradient method to investigate the monotonicity and convergence of the Elman neural network (ENN) based on the entropy error function (EEF) and regularization methods. This enhances network stability and sparsity while also boosting its ability to generalize. Traditional mean square error (mse) functions in complex networks often result in slower convergence during training, prone-to-local minima, and even incorrect saturation issues. To address this drawback, we propose a novel EEF for training ENN, effectively avoiding the problem of learning speed degradation. Furthermore, by leveraging smoothing group L_1/2 regularization (SGL_1/2) methods in studying ENN based on EEF, we effectively overcome the drawbacks of traditional group L_1/2 regularization (GL_1/2) leading to error function oscillations. In addition, we optimize the network architecture effectively in two key ways: reducing redundant nodes to near 0 and driving redundant weights toward 0 for remaining nodes, further boosting network sparsity. This article rigorously proves the monotonicity of the error function, alongside presenting strong and weak convergence outcomes for the novel method. The effectiveness and correctness of our approach are clearly illustrated through experimental results. The simulation results align with the theoretical findings.

源语言	英语
文章编号	0b00006493f735e1
期刊	IEEE Transactions on Neural Networks and Learning Systems
DOI	https://doi.org/10.1109/TNNLS.2025.3562223
出版状态	已接受/待刊 - 2025

访问文件

10.1109/TNNLS.2025.3562223

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2d902095395a495e8430f2c4a2021162,

title = "Deterministic Convergence Analysis and Application of Elman Neural Network via Sparse Mechanism and Entropy Error Function",

abstract = "In this study, we employed the batch gradient method to investigate the monotonicity and convergence of the Elman neural network (ENN) based on the entropy error function (EEF) and regularization methods. This enhances network stability and sparsity while also boosting its ability to generalize. Traditional mean square error (mse) functions in complex networks often result in slower convergence during training, prone-to-local minima, and even incorrect saturation issues. To address this drawback, we propose a novel EEF for training ENN, effectively avoiding the problem of learning speed degradation. Furthermore, by leveraging smoothing group L1/2 regularization (SGL1/2) methods in studying ENN based on EEF, we effectively overcome the drawbacks of traditional group L1/2 regularization (GL1/2) leading to error function oscillations. In addition, we optimize the network architecture effectively in two key ways: reducing redundant nodes to near 0 and driving redundant weights toward 0 for remaining nodes, further boosting network sparsity. This article rigorously proves the monotonicity of the error function, alongside presenting strong and weak convergence outcomes for the novel method. The effectiveness and correctness of our approach are clearly illustrated through experimental results. The simulation results align with the theoretical findings.",

keywords = "Batch gradient method, convergence analysis, Elman neural network (ENN), entropy error function (EEF), regularization",

author = "Qian Kang and Dengxiu Yu and Bowen Xu and Zhen Wang",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2025",

doi = "10.1109/TNNLS.2025.3562223",

language = "英语",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

}

TY - JOUR

T1 - Deterministic Convergence Analysis and Application of Elman Neural Network via Sparse Mechanism and Entropy Error Function

AU - Kang, Qian

AU - Yu, Dengxiu

AU - Xu, Bowen

AU - Wang, Zhen

PY - 2025

Y1 - 2025

N2 - In this study, we employed the batch gradient method to investigate the monotonicity and convergence of the Elman neural network (ENN) based on the entropy error function (EEF) and regularization methods. This enhances network stability and sparsity while also boosting its ability to generalize. Traditional mean square error (mse) functions in complex networks often result in slower convergence during training, prone-to-local minima, and even incorrect saturation issues. To address this drawback, we propose a novel EEF for training ENN, effectively avoiding the problem of learning speed degradation. Furthermore, by leveraging smoothing group L1/2 regularization (SGL1/2) methods in studying ENN based on EEF, we effectively overcome the drawbacks of traditional group L1/2 regularization (GL1/2) leading to error function oscillations. In addition, we optimize the network architecture effectively in two key ways: reducing redundant nodes to near 0 and driving redundant weights toward 0 for remaining nodes, further boosting network sparsity. This article rigorously proves the monotonicity of the error function, alongside presenting strong and weak convergence outcomes for the novel method. The effectiveness and correctness of our approach are clearly illustrated through experimental results. The simulation results align with the theoretical findings.

AB - In this study, we employed the batch gradient method to investigate the monotonicity and convergence of the Elman neural network (ENN) based on the entropy error function (EEF) and regularization methods. This enhances network stability and sparsity while also boosting its ability to generalize. Traditional mean square error (mse) functions in complex networks often result in slower convergence during training, prone-to-local minima, and even incorrect saturation issues. To address this drawback, we propose a novel EEF for training ENN, effectively avoiding the problem of learning speed degradation. Furthermore, by leveraging smoothing group L1/2 regularization (SGL1/2) methods in studying ENN based on EEF, we effectively overcome the drawbacks of traditional group L1/2 regularization (GL1/2) leading to error function oscillations. In addition, we optimize the network architecture effectively in two key ways: reducing redundant nodes to near 0 and driving redundant weights toward 0 for remaining nodes, further boosting network sparsity. This article rigorously proves the monotonicity of the error function, alongside presenting strong and weak convergence outcomes for the novel method. The effectiveness and correctness of our approach are clearly illustrated through experimental results. The simulation results align with the theoretical findings.

KW - Batch gradient method

KW - convergence analysis

KW - Elman neural network (ENN)

KW - entropy error function (EEF)

KW - regularization

UR - http://www.scopus.com/inward/record.url?scp=105006632939&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2025.3562223

DO - 10.1109/TNNLS.2025.3562223

M3 - 文章

AN - SCOPUS:105006632939

SN - 2162-237X

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

M1 - 0b00006493f735e1

ER -