跳到主要导航 跳到搜索 跳到主要内容

HCDCMQ: Hessian-aware Channel Determinism-decomposition With Counterfactual Multi-agent Optimization For Channel-wise Mixed-precision Post-training Quantization

  • Wentao Xu
  • , Jiaxiang Wang
  • , Ruize Wang
  • , Xiang Li
  • , Yuxuan Zhang
  • , Xiang Wu
  • , Weilin Li
  • , Fengdong Qu
  • Northwestern Polytechnical University Xian

科研成果: 期刊稿件文章同行评审

摘要

Post training quantization (PTQ) enables efficient deployment of deep neural networks on hardware with limited resources. Channel-wise mixed-precision quantization often yields a superior balance between accuracy and efficiency compared with uniform-bit PTQ. However, this approach encounters major obstacles including an exponentially large configuration space, unstable sensitivity evaluation caused by inter channel coupling, and ambiguous credit attribution during fine grained exploration. A channel-wise mixed precision post training quantization framework named HCDCMQ is introduced, combining Hessian aware Channel Determinism Decomposition (HCDD) with a counterfactual multi agent (COMA) policy gradients method. HCDD constructs a unified measure of channel sensitivity and fixes deterministic channels, thereby shrinking the practical search domain. Channels that remain ambiguous are grouped within a responsibility space to generate a compact discrete action set. With joint optimization targets covering accuracy, model size, and latency, the counterfactual baseline estimates the marginal impact of each agent, which lowers reward variance and improves the stability of policy optimization. Evaluations on ImageNet demonstrate that HCDCMQ achieves superior accuracy and efficiency outcomes relative to uniform bit post training quantization on ResNet 18, ResNet 50, MobileNetV2, InceptionV3, RegNetX 600 M, and RegNetX 3.2 G. On ResNet 50, HCDCMQ attains 75.775% top-1 accuracy while preserving only 13.4% of the FP32 model size and 4.4% of the total bit-operations.

源语言英语
文章编号133191
期刊Neurocomputing
679
DOI
出版状态已出版 - 28 5月 2026

指纹

探究 'HCDCMQ: Hessian-aware Channel Determinism-decomposition With Counterfactual Multi-agent Optimization For Channel-wise Mixed-precision Post-training Quantization' 的科研主题。它们共同构成独一无二的指纹。

引用此