RUFP: Reinitializing unimportant filters for soft pruning

Ke Zhang; Guangzhe Liu; Meibo Lv

doi:10.1016/j.neucom.2022.02.024

RUFP: Reinitializing unimportant filters for soft pruning

Ke Zhang, Guangzhe Liu, Meibo Lv

School of Astronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

12 Scopus citations

Abstract

Network pruning has become a popular method to reduce the storage and computational complexity of deep neural networks. In order to minimize the performance loss, soft pruning retains a large model capacity by setting unimportant weights to zero and allowing them to be updated. However, these weights are difficult to reactivate due to the small amplitude and frequent resets. In this paper, we propose a novel method, termed RUFP, to reinitialize unimportant filters according to the most important one, which not only gives these filters a chance to be reactivated, but also introduces more filter forms that may win the initialization lottery. By gradually increasing the reinitialization ratio and decreasing the reassigned values of factors in the batch normalization layer, soft pruning is achieved. Benefiting from the large model capacity and multiple reinitializations, the compressed model after fine-tuning achieves superior performance. Extensive experiments demonstrate the effectiveness of this method in improving the accuracy of the pruned model. The accuracy of ResNet-56 on CIFAR-10 is improved from 93.05% to 93.17% while reducing 57.7% calculations and 58.8% parameters. Compared with the traditional soft pruning method and other state-of-the-art methods, our RUFP obtains outstanding performance at various compression levels.

Original language	English
Pages (from-to)	311-321
Number of pages	11
Journal	Neurocomputing
Volume	483
DOIs	https://doi.org/10.1016/j.neucom.2022.02.024
State	Published - 28 Apr 2022

Keywords

Model compression
Network pruning
Soft pruning
Weight reinitialization

Access to Document

10.1016/j.neucom.2022.02.024

Cite this

@article{f1aded3c45e64a59a29d727a66d7d090,

title = "RUFP: Reinitializing unimportant filters for soft pruning",

abstract = "Network pruning has become a popular method to reduce the storage and computational complexity of deep neural networks. In order to minimize the performance loss, soft pruning retains a large model capacity by setting unimportant weights to zero and allowing them to be updated. However, these weights are difficult to reactivate due to the small amplitude and frequent resets. In this paper, we propose a novel method, termed RUFP, to reinitialize unimportant filters according to the most important one, which not only gives these filters a chance to be reactivated, but also introduces more filter forms that may win the initialization lottery. By gradually increasing the reinitialization ratio and decreasing the reassigned values of factors in the batch normalization layer, soft pruning is achieved. Benefiting from the large model capacity and multiple reinitializations, the compressed model after fine-tuning achieves superior performance. Extensive experiments demonstrate the effectiveness of this method in improving the accuracy of the pruned model. The accuracy of ResNet-56 on CIFAR-10 is improved from 93.05% to 93.17% while reducing 57.7% calculations and 58.8% parameters. Compared with the traditional soft pruning method and other state-of-the-art methods, our RUFP obtains outstanding performance at various compression levels.",

keywords = "Model compression, Network pruning, Soft pruning, Weight reinitialization",

author = "Ke Zhang and Guangzhe Liu and Meibo Lv",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier B.V.",

year = "2022",

month = apr,

day = "28",

doi = "10.1016/j.neucom.2022.02.024",

language = "英语",

volume = "483",

pages = "311--321",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - RUFP

T2 - Reinitializing unimportant filters for soft pruning

AU - Zhang, Ke

AU - Liu, Guangzhe

AU - Lv, Meibo

PY - 2022/4/28

Y1 - 2022/4/28

N2 - Network pruning has become a popular method to reduce the storage and computational complexity of deep neural networks. In order to minimize the performance loss, soft pruning retains a large model capacity by setting unimportant weights to zero and allowing them to be updated. However, these weights are difficult to reactivate due to the small amplitude and frequent resets. In this paper, we propose a novel method, termed RUFP, to reinitialize unimportant filters according to the most important one, which not only gives these filters a chance to be reactivated, but also introduces more filter forms that may win the initialization lottery. By gradually increasing the reinitialization ratio and decreasing the reassigned values of factors in the batch normalization layer, soft pruning is achieved. Benefiting from the large model capacity and multiple reinitializations, the compressed model after fine-tuning achieves superior performance. Extensive experiments demonstrate the effectiveness of this method in improving the accuracy of the pruned model. The accuracy of ResNet-56 on CIFAR-10 is improved from 93.05% to 93.17% while reducing 57.7% calculations and 58.8% parameters. Compared with the traditional soft pruning method and other state-of-the-art methods, our RUFP obtains outstanding performance at various compression levels.

AB - Network pruning has become a popular method to reduce the storage and computational complexity of deep neural networks. In order to minimize the performance loss, soft pruning retains a large model capacity by setting unimportant weights to zero and allowing them to be updated. However, these weights are difficult to reactivate due to the small amplitude and frequent resets. In this paper, we propose a novel method, termed RUFP, to reinitialize unimportant filters according to the most important one, which not only gives these filters a chance to be reactivated, but also introduces more filter forms that may win the initialization lottery. By gradually increasing the reinitialization ratio and decreasing the reassigned values of factors in the batch normalization layer, soft pruning is achieved. Benefiting from the large model capacity and multiple reinitializations, the compressed model after fine-tuning achieves superior performance. Extensive experiments demonstrate the effectiveness of this method in improving the accuracy of the pruned model. The accuracy of ResNet-56 on CIFAR-10 is improved from 93.05% to 93.17% while reducing 57.7% calculations and 58.8% parameters. Compared with the traditional soft pruning method and other state-of-the-art methods, our RUFP obtains outstanding performance at various compression levels.

KW - Model compression

KW - Network pruning

KW - Soft pruning

KW - Weight reinitialization

UR - http://www.scopus.com/inward/record.url?scp=85124586161&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2022.02.024

DO - 10.1016/j.neucom.2022.02.024

M3 - 文章

AN - SCOPUS:85124586161

SN - 0925-2312

VL - 483

SP - 311

EP - 321

JO - Neurocomputing

JF - Neurocomputing

ER -

RUFP: Reinitializing unimportant filters for soft pruning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this