Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Sicong Liu, Wentao Zhou, Zimu Zhou, Bin Guo, Minfan Wang, Cheng Fang, Zheng Lin, Zhiwen Yu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the mobile devices hold potential to accelerate DL inference via parallel execution across heterogeneous processors. Various efficient parallel methods have been explored to optimize computation distribution, achieve load balance, and minimize communication cost across processors. Yet their practical effectiveness in the dynamic and diverse real-world mobile environment is less explored. This paper presents a holistic empirical study to assess the capabilities and challenges associated with parallel DL inference on heterogeneous mobile processors. Through carefully designed experiments covering various DL models, mobile software/hardware environments, workload patterns, and resource availability, we identify limitations of existing techniques and highlight opportunities for cross-level optimization.

Original languageEnglish
Title of host publicationAdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems
PublisherAssociation for Computing Machinery, Inc
Pages1-6
Number of pages6
ISBN (Electronic)9798400706646
DOIs
StatePublished - 3 Jun 2024
Event2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024 - Minato-ku, Japan
Duration: 3 Jun 20247 Jun 2024

Publication series

NameAdaAIoTSys 2024 - Proceedings of the 2024 AdaAIoTSys 2024 - Workshop on Adaptive AIoT Systems

Conference

Conference2024 Workshop on Adaptive AIoT Systems, AdaAIoTSys 2024
Country/TerritoryJapan
CityMinato-ku
Period3/06/247/06/24

Keywords

  • Heterogeneous processors
  • parallel DL inference

Fingerprint

Dive into the research topics of 'Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls'. Together they form a unique fingerprint.

Cite this