边端融合的终端情境自适应深度感知模型

Hong Li Wang; Bin Guo; Si Cong Liu; Jia Qi Liu; Yun Gang Wu; Zhi Wen Yu

doi:10.3785/j.issn.1008-973X.2021.04.004

边端融合的终端情境自适应深度感知模型

Hong Li Wang, Bin Guo, Si Cong Liu, Jia Qi Liu, Yun Gang Wu, Zhi Wen Yu

计算机学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The end context adaptative of deep models with edge-end collaboration was analyzed. The partition and alternating direction method of multiplier method (X-ADMM) was proposed. The model compression was employed to simplify the model structure, and the model was partitioned at layer granularity to find the best partition point. The model can collaborate with edge-end devices to improve model operation efficiency. The graph based adaptive DNN surgery algorithm (GADS) was proposed in order to realize the dynamic adaptation of model partition. The model will preferentially search for the partition point that best meets resource constraints among surrounding partition states to achieve rapid adaptation when the running context (e.g., storage, power, bandwidth) of the model changes. The experimental results showed that the model realized the adaptive tuning of model partition point in an average of 0.1 ms. The total running latency was reduced by 56.65% at the highest with no more than 2.5% accuracy loss.

投稿的翻译标题	End context-adaptative deep sensing model with edge-end collaboration
源语言	繁体中文
页（从-至）	626-638
页数	13
期刊	Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science)
卷	55
期	4
DOI	https://doi.org/10.3785/j.issn.1008-973X.2021.04.004
出版状态	已出版 - 4月 2021

关键词

Adaptive perception
Deep learning
Edge intelligence
Model compression
Model partition

访问文件

10.3785/j.issn.1008-973X.2021.04.004

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{c924a2b4ea3a45aabb0285bf9a8331c7,

title = "边端融合的终端情境自适应深度感知模型",

abstract = "The end context adaptative of deep models with edge-end collaboration was analyzed. The partition and alternating direction method of multiplier method (X-ADMM) was proposed. The model compression was employed to simplify the model structure, and the model was partitioned at layer granularity to find the best partition point. The model can collaborate with edge-end devices to improve model operation efficiency. The graph based adaptive DNN surgery algorithm (GADS) was proposed in order to realize the dynamic adaptation of model partition. The model will preferentially search for the partition point that best meets resource constraints among surrounding partition states to achieve rapid adaptation when the running context (e.g., storage, power, bandwidth) of the model changes. The experimental results showed that the model realized the adaptive tuning of model partition point in an average of 0.1 ms. The total running latency was reduced by 56.65% at the highest with no more than 2.5% accuracy loss.",

keywords = "Adaptive perception, Deep learning, Edge intelligence, Model compression, Model partition",

author = "Wang, {Hong Li} and Bin Guo and Liu, {Si Cong} and Liu, {Jia Qi} and Wu, {Yun Gang} and Yu, {Zhi Wen}",

year = "2021",

month = apr,

doi = "10.3785/j.issn.1008-973X.2021.04.004",

language = "繁体中文",

volume = "55",

pages = "626--638",

journal = "Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science)",

issn = "1008-973X",

publisher = "Zhejiang University Press",

number = "4",

}

TY - JOUR

T1 - 边端融合的终端情境自适应深度感知模型

AU - Wang, Hong Li

AU - Guo, Bin

AU - Liu, Si Cong

AU - Liu, Jia Qi

AU - Wu, Yun Gang

AU - Yu, Zhi Wen

PY - 2021/4

Y1 - 2021/4

N2 - The end context adaptative of deep models with edge-end collaboration was analyzed. The partition and alternating direction method of multiplier method (X-ADMM) was proposed. The model compression was employed to simplify the model structure, and the model was partitioned at layer granularity to find the best partition point. The model can collaborate with edge-end devices to improve model operation efficiency. The graph based adaptive DNN surgery algorithm (GADS) was proposed in order to realize the dynamic adaptation of model partition. The model will preferentially search for the partition point that best meets resource constraints among surrounding partition states to achieve rapid adaptation when the running context (e.g., storage, power, bandwidth) of the model changes. The experimental results showed that the model realized the adaptive tuning of model partition point in an average of 0.1 ms. The total running latency was reduced by 56.65% at the highest with no more than 2.5% accuracy loss.

AB - The end context adaptative of deep models with edge-end collaboration was analyzed. The partition and alternating direction method of multiplier method (X-ADMM) was proposed. The model compression was employed to simplify the model structure, and the model was partitioned at layer granularity to find the best partition point. The model can collaborate with edge-end devices to improve model operation efficiency. The graph based adaptive DNN surgery algorithm (GADS) was proposed in order to realize the dynamic adaptation of model partition. The model will preferentially search for the partition point that best meets resource constraints among surrounding partition states to achieve rapid adaptation when the running context (e.g., storage, power, bandwidth) of the model changes. The experimental results showed that the model realized the adaptive tuning of model partition point in an average of 0.1 ms. The total running latency was reduced by 56.65% at the highest with no more than 2.5% accuracy loss.

KW - Adaptive perception

KW - Deep learning

KW - Edge intelligence

KW - Model compression

KW - Model partition

UR - http://www.scopus.com/inward/record.url?scp=85105272293&partnerID=8YFLogxK

U2 - 10.3785/j.issn.1008-973X.2021.04.004

DO - 10.3785/j.issn.1008-973X.2021.04.004

M3 - 文章

AN - SCOPUS:85105272293

SN - 1008-973X

VL - 55

SP - 626

EP - 638

JO - Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science)

JF - Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science)

IS - 4

ER -

边端融合的终端情境自适应深度感知模型

摘要

关键词

访问文件

其它文件与链接

指纹

引用此