Reweighted Alternating Direction Method of Multipliers for DNN weight pruning

Ming Yuan; Lin Du; Feng Jiang; Jianchao Bai; Guanrong Chen

doi:10.1016/j.neunet.2024.106534

Reweighted Alternating Direction Method of Multipliers for DNN weight pruning

Ming Yuan, Lin Du, Feng Jiang, Jianchao Bai, Guanrong Chen

数学与统计学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

As Deep Neural Networks (DNNs) continue to grow in complexity and size, leading to a substantial computational burden, weight pruning techniques have emerged as an effective solution. This paper presents a novel method for dynamic regularization-based pruning, which incorporates the Alternating Direction Method of Multipliers (ADMM). Unlike conventional methods that employ simple and abrupt threshold processing, the proposed method introduces a reweighting mechanism to assign importance to the weights in DNNs. Compared to other ADMM-based methods, the new method not only achieves higher accuracy but also saves considerable time thanks to the reduced number of necessary hyperparameters. The method is evaluated on multiple architectures, including LeNet-5, ResNet-32, ResNet-56, and ResNet-50, using the MNIST, CIFAR-10, and ImageNet datasets, respectively. Experimental results demonstrate its superior performance in terms of compression ratios and accuracy compared to state-of-the-art pruning methods. In particular, on the LeNet-5 model for the MNIST dataset, it achieves compression ratios of 355.9× with a slight improvement in accuracy; on the ResNet-50 model trained with the ImageNet dataset, it achieves compression ratios of 4.24× without sacrificing accuracy.

源语言	英语
文章编号	106534
期刊	Neural Networks
卷	179
DOI	https://doi.org/10.1016/j.neunet.2024.106534
出版状态	已出版 - 11月 2024

访问文件

10.1016/j.neunet.2024.106534

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{118b63cc7684469786baf82e8ddac020,

title = "Reweighted Alternating Direction Method of Multipliers for DNN weight pruning",

abstract = "As Deep Neural Networks (DNNs) continue to grow in complexity and size, leading to a substantial computational burden, weight pruning techniques have emerged as an effective solution. This paper presents a novel method for dynamic regularization-based pruning, which incorporates the Alternating Direction Method of Multipliers (ADMM). Unlike conventional methods that employ simple and abrupt threshold processing, the proposed method introduces a reweighting mechanism to assign importance to the weights in DNNs. Compared to other ADMM-based methods, the new method not only achieves higher accuracy but also saves considerable time thanks to the reduced number of necessary hyperparameters. The method is evaluated on multiple architectures, including LeNet-5, ResNet-32, ResNet-56, and ResNet-50, using the MNIST, CIFAR-10, and ImageNet datasets, respectively. Experimental results demonstrate its superior performance in terms of compression ratios and accuracy compared to state-of-the-art pruning methods. In particular, on the LeNet-5 model for the MNIST dataset, it achieves compression ratios of 355.9× with a slight improvement in accuracy; on the ResNet-50 model trained with the ImageNet dataset, it achieves compression ratios of 4.24× without sacrificing accuracy.",

keywords = "Alternating direction method of multipliers, Deep neural network, Pruning, Sparsity",

author = "Ming Yuan and Lin Du and Feng Jiang and Jianchao Bai and Guanrong Chen",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2024",

month = nov,

doi = "10.1016/j.neunet.2024.106534",

language = "英语",

volume = "179",

journal = "Neural Networks",

issn = "0893-6080",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - Reweighted Alternating Direction Method of Multipliers for DNN weight pruning

AU - Yuan, Ming

AU - Du, Lin

AU - Jiang, Feng

AU - Bai, Jianchao

AU - Chen, Guanrong

PY - 2024/11

Y1 - 2024/11

N2 - As Deep Neural Networks (DNNs) continue to grow in complexity and size, leading to a substantial computational burden, weight pruning techniques have emerged as an effective solution. This paper presents a novel method for dynamic regularization-based pruning, which incorporates the Alternating Direction Method of Multipliers (ADMM). Unlike conventional methods that employ simple and abrupt threshold processing, the proposed method introduces a reweighting mechanism to assign importance to the weights in DNNs. Compared to other ADMM-based methods, the new method not only achieves higher accuracy but also saves considerable time thanks to the reduced number of necessary hyperparameters. The method is evaluated on multiple architectures, including LeNet-5, ResNet-32, ResNet-56, and ResNet-50, using the MNIST, CIFAR-10, and ImageNet datasets, respectively. Experimental results demonstrate its superior performance in terms of compression ratios and accuracy compared to state-of-the-art pruning methods. In particular, on the LeNet-5 model for the MNIST dataset, it achieves compression ratios of 355.9× with a slight improvement in accuracy; on the ResNet-50 model trained with the ImageNet dataset, it achieves compression ratios of 4.24× without sacrificing accuracy.

AB - As Deep Neural Networks (DNNs) continue to grow in complexity and size, leading to a substantial computational burden, weight pruning techniques have emerged as an effective solution. This paper presents a novel method for dynamic regularization-based pruning, which incorporates the Alternating Direction Method of Multipliers (ADMM). Unlike conventional methods that employ simple and abrupt threshold processing, the proposed method introduces a reweighting mechanism to assign importance to the weights in DNNs. Compared to other ADMM-based methods, the new method not only achieves higher accuracy but also saves considerable time thanks to the reduced number of necessary hyperparameters. The method is evaluated on multiple architectures, including LeNet-5, ResNet-32, ResNet-56, and ResNet-50, using the MNIST, CIFAR-10, and ImageNet datasets, respectively. Experimental results demonstrate its superior performance in terms of compression ratios and accuracy compared to state-of-the-art pruning methods. In particular, on the LeNet-5 model for the MNIST dataset, it achieves compression ratios of 355.9× with a slight improvement in accuracy; on the ResNet-50 model trained with the ImageNet dataset, it achieves compression ratios of 4.24× without sacrificing accuracy.

KW - Alternating direction method of multipliers

KW - Deep neural network

KW - Pruning

KW - Sparsity

UR - http://www.scopus.com/inward/record.url?scp=85199480494&partnerID=8YFLogxK

U2 - 10.1016/j.neunet.2024.106534

DO - 10.1016/j.neunet.2024.106534

M3 - 文章

C2 - 39059046

AN - SCOPUS:85199480494

SN - 0893-6080

VL - 179

JO - Neural Networks

JF - Neural Networks

M1 - 106534

ER -

Reweighted Alternating Direction Method of Multipliers for DNN weight pruning

摘要

访问文件

其它文件与链接

指纹

引用此