Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Sicong Liu; Wentao Zhou; Zimu Zhou; Bin Guo; Minfan Wang; Cheng Fang; Zheng Lin; Zhiwen Yu

doi:10.1145/3662007.3663881

Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Sicong Liu, Wentao Zhou, Zimu Zhou, Bin Guo, Minfan Wang, Cheng Fang, Zheng Lin, Zhiwen Yu

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization.

Original language	English
Title of host publication	AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems
Publisher	Association for Computing Machinery, Inc
Pages	1-6
Number of pages	6
ISBN (Electronic)	9798400706646
DOIs	https://doi.org/10.1145/3662007.3663881
State	Published - 3 Jun 2024
Event	2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024 - Minato-ku, Japan Duration: 3 Jun 2024 → 7 Jun 2024

Publication series

Name	AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems

Conference

Conference	2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024
Country/Territory	Japan
City	Minato-ku
Period	3/06/24 → 7/06/24

Keywords

Heterogeneous processors
parallel DL inference

Access to Document

10.1145/3662007.3663881

Cite this

Liu, S., Zhou, W., Zhou, Z., Guo, B., Wang, M., Fang, C., Lin, Z., & Yu, Z. (2024). Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls. In AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems (pp. 1-6). (AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems). Association for Computing Machinery, Inc. https://doi.org/10.1145/3662007.3663881

Liu, Sicong ; Zhou, Wentao ; Zhou, Zimu et al. / Deep Learning Inference on Heterogeneous Mobile Processors : Potentials and Pitfalls. AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems. Association for Computing Machinery, Inc, 2024. pp. 1-6 (AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems).

@inproceedings{6ae7a6113c684d3e94a3022b9da51dd5,

title = "Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls",

abstract = "There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization.",

keywords = "Heterogeneous processors, parallel DL inference",

author = "Sicong Liu and Wentao Zhou and Zimu Zhou and Bin Guo and Minfan Wang and Cheng Fang and Zheng Lin and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2024 Copyright held by the owner/author(s). Publication rights licensed to ACM.; 2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024 ; Conference date: 03-06-2024 Through 07-06-2024",

year = "2024",

month = jun,

day = "3",

doi = "10.1145/3662007.3663881",

language = "英语",

series = "AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems",

publisher = "Association for Computing Machinery, Inc",

pages = "1--6",

booktitle = "AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems",

}

Liu, S, Zhou, W, Zhou, Z, Guo, B, Wang, M, Fang, C, Lin, Z & Yu, Z 2024, Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls. in AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems. AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems, Association for Computing Machinery, Inc, pp. 1-6, 2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024, Minato-ku, Japan, 3/06/24. https://doi.org/10.1145/3662007.3663881

Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls. / Liu, Sicong; Zhou, Wentao; Zhou, Zimu et al.
AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems. Association for Computing Machinery, Inc, 2024. p. 1-6 (AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Deep Learning Inference on Heterogeneous Mobile Processors

T2 - 2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024

AU - Liu, Sicong

AU - Zhou, Wentao

AU - Zhou, Zimu

AU - Guo, Bin

AU - Wang, Minfan

AU - Fang, Cheng

AU - Lin, Zheng

AU - Yu, Zhiwen

PY - 2024/6/3

Y1 - 2024/6/3

N2 - There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization.

AB - There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization.

KW - Heterogeneous processors

KW - parallel DL inference

UR - http://www.scopus.com/inward/record.url?scp=85196550753&partnerID=8YFLogxK

U2 - 10.1145/3662007.3663881

DO - 10.1145/3662007.3663881

M3 - 会议稿件

AN - SCOPUS:85196550753

T3 - AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems

SP - 1

EP - 6

BT - AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems

PB - Association for Computing Machinery, Inc

Y2 - 3 June 2024 through 7 June 2024

ER -

Liu S, Zhou W, Zhou Z, Guo B, Wang M, Fang C et al. Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls. In AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems. Association for Computing Machinery, Inc. 2024. p. 1-6. (AdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems). doi: 10.1145/3662007.3663881

Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this