Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks

Qimin Ding; Ruonan Zhang; Yi Jiang; Daosen Zhai; Bin Li

doi:10.1109/WCSP.2019.8928083

Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks

Qimin Ding, Ruonan Zhang, Yi Jiang, Daosen Zhai, Bin Li

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Many convolutional neural networks(CNNs) have been proposed to solve computer vision tasks such as image classification and image segmentation. However the CNNs usually contain a large number of parameters to determine which consumes very high computation and power resources. Thus, it is difficult to deploy the CNNs on resource-limited devices. Network pruning and network quantization are two main methods to compress the CNNs, researchers often apply these methods individually without considering the relationship between them. In this paper, we explore the coupling relationship between network pruning and quantization, as well as the limits of the current network compression training method. Then we propose a new regularized training method that can combine pruning and quantization within a simple training framework. Experiments show that by using the proposed training framework, the finetune process is not needed anymore and hence we can reduce much time for training a network. The simulation results also show that the performance of the network can over-perform the traditional methods. The proposed framework is suitable for the CNNs deployed in portable devices with limited computational resources and power supply.

源语言	英语
主期刊名	2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9781728135557
DOI	https://doi.org/10.1109/WCSP.2019.8928083
出版状态	已出版 - 10月 2019
活动	11th International Conference on Wireless Communications and Signal Processing, WCSP 2019 - Xi'an, 中国期限: 23 10月 2019 → 25 10月 2019

出版系列

姓名	2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019

会议

会议	11th International Conference on Wireless Communications and Signal Processing, WCSP 2019
国家/地区	中国
市	Xi'an
时期	23/10/19 → 25/10/19

访问文件

10.1109/WCSP.2019.8928083

其它文件与链接

链接到 Scopus 的出版物

引用此

Ding, Q., Zhang, R., Jiang, Y., Zhai, D., & Li, B. (2019). Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks. 在 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019 文章 8928083 (2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WCSP.2019.8928083

Ding, Qimin ; Zhang, Ruonan ; Jiang, Yi 等. / Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks. 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019. Institute of Electrical and Electronics Engineers Inc., 2019. (2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019).

@inproceedings{9b6cb8babc514a14b4b95d30ad95cc48,

title = "Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks",

abstract = "Many convolutional neural networks(CNNs) have been proposed to solve computer vision tasks such as image classification and image segmentation. However the CNNs usually contain a large number of parameters to determine which consumes very high computation and power resources. Thus, it is difficult to deploy the CNNs on resource-limited devices. Network pruning and network quantization are two main methods to compress the CNNs, researchers often apply these methods individually without considering the relationship between them. In this paper, we explore the coupling relationship between network pruning and quantization, as well as the limits of the current network compression training method. Then we propose a new regularized training method that can combine pruning and quantization within a simple training framework. Experiments show that by using the proposed training framework, the finetune process is not needed anymore and hence we can reduce much time for training a network. The simulation results also show that the performance of the network can over-perform the traditional methods. The proposed framework is suitable for the CNNs deployed in portable devices with limited computational resources and power supply.",

keywords = "convolutional neural networks, coupling relationship, fuzzy rules, network compression, training framework",

author = "Qimin Ding and Ruonan Zhang and Yi Jiang and Daosen Zhai and Bin Li",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019 ; Conference date: 23-10-2019 Through 25-10-2019",

year = "2019",

month = oct,

doi = "10.1109/WCSP.2019.8928083",

language = "英语",

series = "2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019",

}

Ding, Q, Zhang, R, Jiang, Y, Zhai, D & Li, B 2019, Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks. 在 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019., 8928083, 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019, Institute of Electrical and Electronics Engineers Inc., 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019, Xi'an, 中国, 23/10/19. https://doi.org/10.1109/WCSP.2019.8928083

Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks. / Ding, Qimin; Zhang, Ruonan; Jiang, Yi 等.
2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 8928083 (2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks

AU - Ding, Qimin

AU - Zhang, Ruonan

AU - Jiang, Yi

AU - Zhai, Daosen

AU - Li, Bin

PY - 2019/10

Y1 - 2019/10

N2 - Many convolutional neural networks(CNNs) have been proposed to solve computer vision tasks such as image classification and image segmentation. However the CNNs usually contain a large number of parameters to determine which consumes very high computation and power resources. Thus, it is difficult to deploy the CNNs on resource-limited devices. Network pruning and network quantization are two main methods to compress the CNNs, researchers often apply these methods individually without considering the relationship between them. In this paper, we explore the coupling relationship between network pruning and quantization, as well as the limits of the current network compression training method. Then we propose a new regularized training method that can combine pruning and quantization within a simple training framework. Experiments show that by using the proposed training framework, the finetune process is not needed anymore and hence we can reduce much time for training a network. The simulation results also show that the performance of the network can over-perform the traditional methods. The proposed framework is suitable for the CNNs deployed in portable devices with limited computational resources and power supply.

AB - Many convolutional neural networks(CNNs) have been proposed to solve computer vision tasks such as image classification and image segmentation. However the CNNs usually contain a large number of parameters to determine which consumes very high computation and power resources. Thus, it is difficult to deploy the CNNs on resource-limited devices. Network pruning and network quantization are two main methods to compress the CNNs, researchers often apply these methods individually without considering the relationship between them. In this paper, we explore the coupling relationship between network pruning and quantization, as well as the limits of the current network compression training method. Then we propose a new regularized training method that can combine pruning and quantization within a simple training framework. Experiments show that by using the proposed training framework, the finetune process is not needed anymore and hence we can reduce much time for training a network. The simulation results also show that the performance of the network can over-perform the traditional methods. The proposed framework is suitable for the CNNs deployed in portable devices with limited computational resources and power supply.

KW - convolutional neural networks

KW - coupling relationship

KW - fuzzy rules

KW - network compression

KW - training framework

UR - http://www.scopus.com/inward/record.url?scp=85077780921&partnerID=8YFLogxK

U2 - 10.1109/WCSP.2019.8928083

DO - 10.1109/WCSP.2019.8928083

M3 - 会议稿件

AN - SCOPUS:85077780921

T3 - 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019

BT - 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019

Y2 - 23 October 2019 through 25 October 2019

ER -

Ding Q, Zhang R, Jiang Y, Zhai D, Li B. Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks. 在 2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019. Institute of Electrical and Electronics Engineers Inc. 2019. 8928083. (2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019). doi: 10.1109/WCSP.2019.8928083

Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此