Compression Method for Convolution Kernel with KCP-FGTD

Rui Lv, Jiangbin Zheng, Siyu Tao, Chenyu Zhang, Chao Hu

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Convolutional neural networks (CNNs) possess strong expressive capabilities, but with the increasing size and cost of modern CNNs, deploying large-scale CNNs in resource-limited environments remains a challenge, making neural network compression a hot topic in contemporary research. Among various compression methods, tensor decomposition shows numerous unique advantages, such as regular data structure, solid mathematical foundation, convenience in training from scratch, and ideal compression rates. However, most existing tensor decompositions do not perform well in compressing CNNs, struggling to balance time and space complexity, and failing to accelerate CNNs in reality. To address these issues. In this paper, we propose a fine-grained KCP tensor decomposition for CNN convolution kernel compression with considerable compression rate and acceleration capability. First, convolution kernels are compressed using 2nd-order KCP decomposition. The decomposed factor tensors are then converted into a four-layer lightweight convolution kernel structure. This structure conducts continuous contraction computations with the input, transforming the convolution computations into a highly efficient lightweight layered matrix multiplication structure. Secondly, we conduct a comprehensive comparison with other methods and analyze the superiority of our method. Comparisons of computational complexity with other typical tensor compression methods, along with simulation experiments on CIFAR-10 and ImageNet datasets, demonstrate that our method can achieve more efficient comprehensive results. These include acceptable accuracy, significantly reduced model size, and real-world acceleration of network inference times. Finally, we further analyze how different combinations of KT rank and CP rank impact the compression effectiveness and expressive capacity of CNNs.

源语言英语
主期刊名2024 IEEE 12th International Conference on Information and Communication Networks, ICICN 2024
出版商Institute of Electrical and Electronics Engineers Inc.
578-583
页数6
ISBN(电子版)9798350355802
DOI
出版状态已出版 - 2024
活动12th IEEE International Conference on Information and Communication Networks, ICICN 2024 - Guilin, 中国
期限: 21 8月 202424 8月 2024

出版系列

姓名2024 IEEE 12th International Conference on Information and Communication Networks, ICICN 2024

会议

会议12th IEEE International Conference on Information and Communication Networks, ICICN 2024
国家/地区中国
Guilin
时期21/08/2424/08/24

指纹

探究 'Compression Method for Convolution Kernel with KCP-FGTD' 的科研主题。它们共同构成独一无二的指纹。

引用此