A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

Ming Yuan; Jianchao Bai; Feng Jiang; Lin Du

doi:10.1016/j.neucom.2024.127327

A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

Ming Yuan, Jianchao Bai, Feng Jiang, Lin Du

数学与统计学院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Weight pruning is widely employed in compressing Deep Neural Networks (DNNs) because of the increasing computation and storage requirement. However, related work failed to efficiently combine the structure of the DNN loss function with the Alternating Direction Method of Multipliers (ADMM). This paper presents a systematic weight pruning framework of DNNs using the advanced symmetric accelerated stochastic ADMM (SAS-ADMM). Specifically, the weight pruning problem of DNNs is formulated as an optimization problem that consists of the DNN loss function and a L1 regularization term. SAS-ADMM is widely used to solve the problem by dividing it into two small-dimensional and relatively easier subproblems. Besides, an optimizer based on SAS-ADMM is presented to make the DNNs after pruning converge. Experimental results demonstrate that our method achieves a faster convergence rate in a better or similar weight pruning rate than previous work. For the CIFAR-10 data set, our method reduces the number of ResNet-32 and ResNet-56 parameters by a factor of 6.61× and 9.93 × while maintaining accuracy. In similar experiments of AlexNet on the ImageNet data set, we achieve 20.9× weight reduction, which only takes half of the time compared with prior work.

源语言	英语
文章编号	127327
期刊	Neurocomputing
卷	575
DOI	https://doi.org/10.1016/j.neucom.2024.127327
出版状态	已出版 - 28 3月 2024

访问文件

10.1016/j.neucom.2024.127327

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{bba45575dfb245258440957c95a93229,

title = "A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM",

abstract = "Weight pruning is widely employed in compressing Deep Neural Networks (DNNs) because of the increasing computation and storage requirement. However, related work failed to efficiently combine the structure of the DNN loss function with the Alternating Direction Method of Multipliers (ADMM). This paper presents a systematic weight pruning framework of DNNs using the advanced symmetric accelerated stochastic ADMM (SAS-ADMM). Specifically, the weight pruning problem of DNNs is formulated as an optimization problem that consists of the DNN loss function and a L1 regularization term. SAS-ADMM is widely used to solve the problem by dividing it into two small-dimensional and relatively easier subproblems. Besides, an optimizer based on SAS-ADMM is presented to make the DNNs after pruning converge. Experimental results demonstrate that our method achieves a faster convergence rate in a better or similar weight pruning rate than previous work. For the CIFAR-10 data set, our method reduces the number of ResNet-32 and ResNet-56 parameters by a factor of 6.61× and 9.93 × while maintaining accuracy. In similar experiments of AlexNet on the ImageNet data set, we achieve 20.9× weight reduction, which only takes half of the time compared with prior work.",

keywords = "Alternating direction method of multipliers, Deep neural networks, Symmetric accelerated stochastic ADMM, Weight pruning",

author = "Ming Yuan and Jianchao Bai and Feng Jiang and Lin Du",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2024",

month = mar,

day = "28",

doi = "10.1016/j.neucom.2024.127327",

language = "英语",

volume = "575",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

AU - Yuan, Ming

AU - Bai, Jianchao

AU - Jiang, Feng

AU - Du, Lin

PY - 2024/3/28

Y1 - 2024/3/28

N2 - Weight pruning is widely employed in compressing Deep Neural Networks (DNNs) because of the increasing computation and storage requirement. However, related work failed to efficiently combine the structure of the DNN loss function with the Alternating Direction Method of Multipliers (ADMM). This paper presents a systematic weight pruning framework of DNNs using the advanced symmetric accelerated stochastic ADMM (SAS-ADMM). Specifically, the weight pruning problem of DNNs is formulated as an optimization problem that consists of the DNN loss function and a L1 regularization term. SAS-ADMM is widely used to solve the problem by dividing it into two small-dimensional and relatively easier subproblems. Besides, an optimizer based on SAS-ADMM is presented to make the DNNs after pruning converge. Experimental results demonstrate that our method achieves a faster convergence rate in a better or similar weight pruning rate than previous work. For the CIFAR-10 data set, our method reduces the number of ResNet-32 and ResNet-56 parameters by a factor of 6.61× and 9.93 × while maintaining accuracy. In similar experiments of AlexNet on the ImageNet data set, we achieve 20.9× weight reduction, which only takes half of the time compared with prior work.

AB - Weight pruning is widely employed in compressing Deep Neural Networks (DNNs) because of the increasing computation and storage requirement. However, related work failed to efficiently combine the structure of the DNN loss function with the Alternating Direction Method of Multipliers (ADMM). This paper presents a systematic weight pruning framework of DNNs using the advanced symmetric accelerated stochastic ADMM (SAS-ADMM). Specifically, the weight pruning problem of DNNs is formulated as an optimization problem that consists of the DNN loss function and a L1 regularization term. SAS-ADMM is widely used to solve the problem by dividing it into two small-dimensional and relatively easier subproblems. Besides, an optimizer based on SAS-ADMM is presented to make the DNNs after pruning converge. Experimental results demonstrate that our method achieves a faster convergence rate in a better or similar weight pruning rate than previous work. For the CIFAR-10 data set, our method reduces the number of ResNet-32 and ResNet-56 parameters by a factor of 6.61× and 9.93 × while maintaining accuracy. In similar experiments of AlexNet on the ImageNet data set, we achieve 20.9× weight reduction, which only takes half of the time compared with prior work.

KW - Alternating direction method of multipliers

KW - Deep neural networks

KW - Symmetric accelerated stochastic ADMM

KW - Weight pruning

UR - http://www.scopus.com/inward/record.url?scp=85184067412&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.127327

DO - 10.1016/j.neucom.2024.127327

M3 - 文章

AN - SCOPUS:85184067412

SN - 0925-2312

VL - 575

JO - Neurocomputing

JF - Neurocomputing

M1 - 127327

ER -

A systematic DNN weight pruning framework based on symmetric accelerated stochastic ADMM

摘要

访问文件

其它文件与链接

指纹

引用此