WMDRS: Workload-Aware Performance Model Based Multi-Task Dynamic-Quota Real-Time Scheduling for Neural Processing Units

Chong Liu, Yuan Yao, Yi Dang, Gang Yang, Wei Jia, Xinyu Tian, Xingshe Zhou

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

To further improve the capacity of airborne embedded system for dealing with deep learning (DL) applications and reduce overall power consumption, it is necessary to equip Neural Processing Units (NPUs). Comparing with the cloud system, the airborne embedded system usually has a fixed application set, but strict real-time constraints. Unfortunately, the inherent NPU scheduler does not consider the application priority, which cannot provide the sufficient real-time capability for the airborne embedded system. At present, there are few researches on multi-task real-time scheduling for NPUs. Therefore, we propose WMDRS, a workload-aware performance model multi-task dynamic-quota real-time scheduling for Neural Processing Units. The NPU performance model based on workload-awareness can accurately predict the remaining execution time of a task, which is running concurrently with other tasks on NPU. The multi-task dynamic-quota real-time scheduling algorithm can provide the approximate preemption by dynamically adjusting NPU computing resources for active applications. In addition, we implement a prototype NPU scheduler without any hardware extension. Furthermore, the proposed NPU performance model and real-time scheduling algorithm are evaluated in realistic application sets. Experimental results demonstrate that WMDRS can achieve low prediction error and high scheduling success ratio.

源语言英语
主期刊名Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022
出版商IEEE Computer Society
435-442
页数8
ISBN(电子版)9781665473156
DOI
出版状态已出版 - 2023
活动28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 - Nanjing, 中国
期限: 10 1月 202312 1月 2023

出版系列

姓名Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
2023-January
ISSN(印刷版)1521-9097

会议

会议28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022
国家/地区中国
Nanjing
时期10/01/2312/01/23

指纹

探究 'WMDRS: Workload-Aware Performance Model Based Multi-Task Dynamic-Quota Real-Time Scheduling for Neural Processing Units' 的科研主题。它们共同构成独一无二的指纹。

引用此