Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks

Qimin Ding, Ruonan Zhang, Yi Jiang, Daosen Zhai, Bin Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Many convolutional neural networks(CNNs) have been proposed to solve computer vision tasks such as image classification and image segmentation. However the CNNs usually contain a large number of parameters to determine which consumes very high computation and power resources. Thus, it is difficult to deploy the CNNs on resource-limited devices. Network pruning and network quantization are two main methods to compress the CNNs, researchers often apply these methods individually without considering the relationship between them. In this paper, we explore the coupling relationship between network pruning and quantization, as well as the limits of the current network compression training method. Then we propose a new regularized training method that can combine pruning and quantization within a simple training framework. Experiments show that by using the proposed training framework, the finetune process is not needed anymore and hence we can reduce much time for training a network. The simulation results also show that the performance of the network can over-perform the traditional methods. The proposed framework is suitable for the CNNs deployed in portable devices with limited computational resources and power supply.

Original languageEnglish
Title of host publication2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728135557
DOIs
StatePublished - Oct 2019
Event11th International Conference on Wireless Communications and Signal Processing, WCSP 2019 - Xi'an, China
Duration: 23 Oct 201925 Oct 2019

Publication series

Name2019 11th International Conference on Wireless Communications and Signal Processing, WCSP 2019

Conference

Conference11th International Conference on Wireless Communications and Signal Processing, WCSP 2019
Country/TerritoryChina
CityXi'an
Period23/10/1925/10/19

Keywords

  • convolutional neural networks
  • coupling relationship
  • fuzzy rules
  • network compression
  • training framework

Fingerprint

Dive into the research topics of 'Regularized Training Framework for Combining Pruning and Quantization to Compress Neural Networks'. Together they form a unique fingerprint.

Cite this