CAQ: Toward Context-Aware and Self-Adaptive Deep Model Computation for AIoT Applications

Sicong Liu; Yungang Wu; Bin Guo; Yuzhan Wang; Ke Ma; Liyao Xiang; Zhetao Li; Zhiwen Yu

doi:10.1109/JIOT.2022.3176136

CAQ: Toward Context-Aware and Self-Adaptive Deep Model Computation for AIoT Applications

Sicong Liu, Yungang Wu, Bin Guo, Yuzhan Wang, Ke Ma, Liyao Xiang, Zhetao Li, Zhiwen Yu

School of Computer Science

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Artificial Intelligence of Things (AIoT) has recently accepted significant interests. Remarkably, embedded artificial intelligence (e.g., deep learning) on-device transforms IoT devices into intelligent systems that robustly and privately process data. Quantization technique is widely used to compress deep models for narrowing the resource gap between computation demands and platform supply. However, existing quantization schemes induce unsatisfaction for IoT scenarios since they are oblivious to dynamic changes of application context (e.g., battery and hierarchical memory availability) during the long-term operation. Subsequently, they will mismatch the user-desired resource efficiency and application lifetime. Also, to adapt to the dynamic context, we can neither accept the latency for model retraining with existing hand-crafted quantization nor the overhead for quantization bit width researching with prior on-demand quantization. This article presents a context-aware and self-adaptive deep model quantization (CAQ) system for IoT application scenarios. CAQ integrates a novel switchable multigate quantization framework, optimizing the quantized model accuracy and energy efficiency in diverse contexts. Based on the learned model, CAQ can switch among different gating networks in a context-aware manner and then adopt it to automatically capture the representation importance of various layers for optimal quantization bit-width selection. The experimental results show that CAQ achieves up to 50% storage savings with even 2.61% higher accuracy than the state-of-the-art baselines.

Original language	English
Pages (from-to)	20801-20814
Number of pages	14
Journal	IEEE Internet of Things Journal
Volume	9
Issue number	21
DOIs	https://doi.org/10.1109/JIOT.2022.3176136
State	Published - 1 Nov 2022

Keywords

Context-aware adaptation
deep model quantization
IoT applications
on-device intelligence

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/JIOT.2022.3176136

Cite this

@article{d69cda9bf8ea41b49125882e2fccbb09,

title = "CAQ: Toward Context-Aware and Self-Adaptive Deep Model Computation for AIoT Applications",

abstract = "Artificial Intelligence of Things (AIoT) has recently accepted significant interests. Remarkably, embedded artificial intelligence (e.g., deep learning) on-device transforms IoT devices into intelligent systems that robustly and privately process data. Quantization technique is widely used to compress deep models for narrowing the resource gap between computation demands and platform supply. However, existing quantization schemes induce unsatisfaction for IoT scenarios since they are oblivious to dynamic changes of application context (e.g., battery and hierarchical memory availability) during the long-term operation. Subsequently, they will mismatch the user-desired resource efficiency and application lifetime. Also, to adapt to the dynamic context, we can neither accept the latency for model retraining with existing hand-crafted quantization nor the overhead for quantization bit width researching with prior on-demand quantization. This article presents a context-aware and self-adaptive deep model quantization (CAQ) system for IoT application scenarios. CAQ integrates a novel switchable multigate quantization framework, optimizing the quantized model accuracy and energy efficiency in diverse contexts. Based on the learned model, CAQ can switch among different gating networks in a context-aware manner and then adopt it to automatically capture the representation importance of various layers for optimal quantization bit-width selection. The experimental results show that CAQ achieves up to 50% storage savings with even 2.61% higher accuracy than the state-of-the-art baselines.",

keywords = "Context-aware adaptation, deep model quantization, IoT applications, on-device intelligence",

author = "Sicong Liu and Yungang Wu and Bin Guo and Yuzhan Wang and Ke Ma and Liyao Xiang and Zhetao Li and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2022",

month = nov,

day = "1",

doi = "10.1109/JIOT.2022.3176136",

language = "英语",

volume = "9",

pages = "20801--20814",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "21",

}

TY - JOUR

T1 - CAQ

T2 - Toward Context-Aware and Self-Adaptive Deep Model Computation for AIoT Applications

AU - Liu, Sicong

AU - Wu, Yungang

AU - Guo, Bin

AU - Wang, Yuzhan

AU - Ma, Ke

AU - Xiang, Liyao

AU - Li, Zhetao

AU - Yu, Zhiwen

PY - 2022/11/1

Y1 - 2022/11/1

N2 - Artificial Intelligence of Things (AIoT) has recently accepted significant interests. Remarkably, embedded artificial intelligence (e.g., deep learning) on-device transforms IoT devices into intelligent systems that robustly and privately process data. Quantization technique is widely used to compress deep models for narrowing the resource gap between computation demands and platform supply. However, existing quantization schemes induce unsatisfaction for IoT scenarios since they are oblivious to dynamic changes of application context (e.g., battery and hierarchical memory availability) during the long-term operation. Subsequently, they will mismatch the user-desired resource efficiency and application lifetime. Also, to adapt to the dynamic context, we can neither accept the latency for model retraining with existing hand-crafted quantization nor the overhead for quantization bit width researching with prior on-demand quantization. This article presents a context-aware and self-adaptive deep model quantization (CAQ) system for IoT application scenarios. CAQ integrates a novel switchable multigate quantization framework, optimizing the quantized model accuracy and energy efficiency in diverse contexts. Based on the learned model, CAQ can switch among different gating networks in a context-aware manner and then adopt it to automatically capture the representation importance of various layers for optimal quantization bit-width selection. The experimental results show that CAQ achieves up to 50% storage savings with even 2.61% higher accuracy than the state-of-the-art baselines.

AB - Artificial Intelligence of Things (AIoT) has recently accepted significant interests. Remarkably, embedded artificial intelligence (e.g., deep learning) on-device transforms IoT devices into intelligent systems that robustly and privately process data. Quantization technique is widely used to compress deep models for narrowing the resource gap between computation demands and platform supply. However, existing quantization schemes induce unsatisfaction for IoT scenarios since they are oblivious to dynamic changes of application context (e.g., battery and hierarchical memory availability) during the long-term operation. Subsequently, they will mismatch the user-desired resource efficiency and application lifetime. Also, to adapt to the dynamic context, we can neither accept the latency for model retraining with existing hand-crafted quantization nor the overhead for quantization bit width researching with prior on-demand quantization. This article presents a context-aware and self-adaptive deep model quantization (CAQ) system for IoT application scenarios. CAQ integrates a novel switchable multigate quantization framework, optimizing the quantized model accuracy and energy efficiency in diverse contexts. Based on the learned model, CAQ can switch among different gating networks in a context-aware manner and then adopt it to automatically capture the representation importance of various layers for optimal quantization bit-width selection. The experimental results show that CAQ achieves up to 50% storage savings with even 2.61% higher accuracy than the state-of-the-art baselines.

KW - Context-aware adaptation

KW - deep model quantization

KW - IoT applications

KW - on-device intelligence

UR - http://www.scopus.com/inward/record.url?scp=85130472628&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2022.3176136

DO - 10.1109/JIOT.2022.3176136

M3 - 文章

AN - SCOPUS:85130472628

SN - 2327-4662

VL - 9

SP - 20801

EP - 20814

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 21

ER -

CAQ: Toward Context-Aware and Self-Adaptive Deep Model Computation for AIoT Applications

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this