Comрuting Caрability Evaluation Method of Embedded Intelligent Comрuter

Chun Yan Ma; Jing Chen; Ding Yao; Tao Zhang

doi:10.11897/SP.J.1016.2023.02279

Comрuting Caрability Evaluation Method of Embedded Intelligent Comрuter

投稿的翻译标题: 嵌入式智能计算机计算能力评测方法

Chun Yan Ma, Jing Chen, Ding Yao, Tao Zhang

Northwestern Polyteсhniсal University

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Computing capability evaluation is one of the research hotspots in the field of embedded intelligent computing. It is an important means to test the execution speed and performance of intelligent computers when performing inference tasks. The rationality of its evaluation results directly affects the optimization and improvement direction of embedded intelligent computers. Due to the various optimization schemes for neural form computing and machine learning accelerators, the evaluation of embedded intelligent computers is more difficult than that of general-purpose computers. Benchmark testing is currently a commonly used evaluation method, but on resource-limited embedded devices, the reuse ability of benchmark test sets and evaluation indicators is limited, making it difficult to adapt to the diversified configuration of embedded intelligent systems. The computational intensity of the neural network models in the test set has a certain randomness, which cannot fully explore the computing potential of the device under test, and the evaluation indicators are not unified, making it difficult to compare and analyze the computing capabilities of different embedded intelligent computers. To solve the problem that embedded intelligent devices have different configurations and cannot be performance tested through fixed models, this paper proposes a neural network model generation algorithm based on neural evolution algorithm to generate neural network models that can characterize the intelligent computing capabilities of embedded intelligent devices, and inversely infer the intelligent computing capabilities of embedded intelligent computers based on the complexity of the model. Firstly, based on the Roofline theory model, the advantages of integrating computing potential mining, resource adaptation, and unified evaluation indicators are used to propose a computing capability evaluation framework that can adapt to various embedded intelligent computers, and its rationality is analyzed. Secondly, a neural network model generation algorithm with the goal of generating the maximum complexity model is proposed. By using the neural evolution algorithm, the computational intensity of the generated model approaches the upper limit of the computational intensity of the embedded intelligent computer, fully exploring the computing potential of the device under test, and making the evaluation results more objective. Then, using a fixed upper machine as a reference, the generated neural network model is cross-operated between the device under test and the upper machine. The floating-point operation per second during the two inference task executions is used as the calculation factor to provide a general formula for computing capability evaluation, which can realize the comparative analysis of computing capabilities of different embedded intelligent computers. Finally, in the Mindspore-cpu, Tensorflow-cpu, and Mindspore-ascend310 frameworks, Huawei Atlas200 is evaluated. Compared with the five neural network models commonly used in benchmark tests, the evaluation results of the neural network models generated in this paper are more reasonable, proving that the intelligent computing capability of two DaVinci cores is 42.37 times that of eight Cortex-A55 cores. The experimental results are within the theoretical expected range, indicating that the model generation method based on neural evolution algorithm proposed in this paper exhibits stable evaluation effects under different computing frameworks. In summary, the computing capability evaluation method for embedded intelligent computers proposed in this paper mainly addresses issues such as resource adaptation, computing potential mining, neural model structural changes, and unified performance evaluation indicators during the evaluation process. It can improve the accuracy of computing capability evaluation for embedded intelligent computers and shorten the product evaluation cycle.

投稿的翻译标题	嵌入式智能计算机计算能力评测方法
源语言	英语
页（从-至）	2279-2301
页数	23
期刊	Jisuanji Xuebao/Chinese Journal of Computers
卷	46
期	11
DOI	https://doi.org/10.11897/SP.J.1016.2023.02279
出版状态	已出版 - 11月 2023
已对外发布	是

访问文件

10.11897/SP.J.1016.2023.02279

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3b6e21f3e1b848b3b841fe038674c3b5,

title = "Comрuting Caрability Evaluation Method of Embedded Intelligent Comрuter",

abstract = "Computing capability evaluation is one of the research hotspots in the field of embedded intelligent computing. It is an important means to test the execution speed and performance of intelligent computers when performing inference tasks. The rationality of its evaluation results directly affects the optimization and improvement direction of embedded intelligent computers. Due to the various optimization schemes for neural form computing and machine learning accelerators, the evaluation of embedded intelligent computers is more difficult than that of general-purpose computers. Benchmark testing is currently a commonly used evaluation method, but on resource-limited embedded devices, the reuse ability of benchmark test sets and evaluation indicators is limited, making it difficult to adapt to the diversified configuration of embedded intelligent systems. The computational intensity of the neural network models in the test set has a certain randomness, which cannot fully explore the computing potential of the device under test, and the evaluation indicators are not unified, making it difficult to compare and analyze the computing capabilities of different embedded intelligent computers. To solve the problem that embedded intelligent devices have different configurations and cannot be performance tested through fixed models, this paper proposes a neural network model generation algorithm based on neural evolution algorithm to generate neural network models that can characterize the intelligent computing capabilities of embedded intelligent devices, and inversely infer the intelligent computing capabilities of embedded intelligent computers based on the complexity of the model. Firstly, based on the Roofline theory model, the advantages of integrating computing potential mining, resource adaptation, and unified evaluation indicators are used to propose a computing capability evaluation framework that can adapt to various embedded intelligent computers, and its rationality is analyzed. Secondly, a neural network model generation algorithm with the goal of generating the maximum complexity model is proposed. By using the neural evolution algorithm, the computational intensity of the generated model approaches the upper limit of the computational intensity of the embedded intelligent computer, fully exploring the computing potential of the device under test, and making the evaluation results more objective. Then, using a fixed upper machine as a reference, the generated neural network model is cross-operated between the device under test and the upper machine. The floating-point operation per second during the two inference task executions is used as the calculation factor to provide a general formula for computing capability evaluation, which can realize the comparative analysis of computing capabilities of different embedded intelligent computers. Finally, in the Mindspore-cpu, Tensorflow-cpu, and Mindspore-ascend310 frameworks, Huawei Atlas200 is evaluated. Compared with the five neural network models commonly used in benchmark tests, the evaluation results of the neural network models generated in this paper are more reasonable, proving that the intelligent computing capability of two DaVinci cores is 42.37 times that of eight Cortex-A55 cores. The experimental results are within the theoretical expected range, indicating that the model generation method based on neural evolution algorithm proposed in this paper exhibits stable evaluation effects under different computing frameworks. In summary, the computing capability evaluation method for embedded intelligent computers proposed in this paper mainly addresses issues such as resource adaptation, computing potential mining, neural model structural changes, and unified performance evaluation indicators during the evaluation process. It can improve the accuracy of computing capability evaluation for embedded intelligent computers and shorten the product evaluation cycle.",

keywords = "Atlas200, computing capability evaluation, embedded intelligent computer, neural evolutionary algorithm, neural network models",

author = "Ma, {Chun Yan} and Jing Chen and Ding Yao and Tao Zhang",

year = "2023",

month = nov,

doi = "10.11897/SP.J.1016.2023.02279",

language = "英语",

volume = "46",

pages = "2279--2301",

journal = "Jisuanji Xuebao/Chinese Journal of Computers",

issn = "0254-4164",

publisher = "Science Press ",

number = "11",

}

TY - JOUR

T1 - Comрuting Caрability Evaluation Method of Embedded Intelligent Comрuter

AU - Ma, Chun Yan

AU - Chen, Jing

AU - Yao, Ding

AU - Zhang, Tao

PY - 2023/11

Y1 - 2023/11

N2 - Computing capability evaluation is one of the research hotspots in the field of embedded intelligent computing. It is an important means to test the execution speed and performance of intelligent computers when performing inference tasks. The rationality of its evaluation results directly affects the optimization and improvement direction of embedded intelligent computers. Due to the various optimization schemes for neural form computing and machine learning accelerators, the evaluation of embedded intelligent computers is more difficult than that of general-purpose computers. Benchmark testing is currently a commonly used evaluation method, but on resource-limited embedded devices, the reuse ability of benchmark test sets and evaluation indicators is limited, making it difficult to adapt to the diversified configuration of embedded intelligent systems. The computational intensity of the neural network models in the test set has a certain randomness, which cannot fully explore the computing potential of the device under test, and the evaluation indicators are not unified, making it difficult to compare and analyze the computing capabilities of different embedded intelligent computers. To solve the problem that embedded intelligent devices have different configurations and cannot be performance tested through fixed models, this paper proposes a neural network model generation algorithm based on neural evolution algorithm to generate neural network models that can characterize the intelligent computing capabilities of embedded intelligent devices, and inversely infer the intelligent computing capabilities of embedded intelligent computers based on the complexity of the model. Firstly, based on the Roofline theory model, the advantages of integrating computing potential mining, resource adaptation, and unified evaluation indicators are used to propose a computing capability evaluation framework that can adapt to various embedded intelligent computers, and its rationality is analyzed. Secondly, a neural network model generation algorithm with the goal of generating the maximum complexity model is proposed. By using the neural evolution algorithm, the computational intensity of the generated model approaches the upper limit of the computational intensity of the embedded intelligent computer, fully exploring the computing potential of the device under test, and making the evaluation results more objective. Then, using a fixed upper machine as a reference, the generated neural network model is cross-operated between the device under test and the upper machine. The floating-point operation per second during the two inference task executions is used as the calculation factor to provide a general formula for computing capability evaluation, which can realize the comparative analysis of computing capabilities of different embedded intelligent computers. Finally, in the Mindspore-cpu, Tensorflow-cpu, and Mindspore-ascend310 frameworks, Huawei Atlas200 is evaluated. Compared with the five neural network models commonly used in benchmark tests, the evaluation results of the neural network models generated in this paper are more reasonable, proving that the intelligent computing capability of two DaVinci cores is 42.37 times that of eight Cortex-A55 cores. The experimental results are within the theoretical expected range, indicating that the model generation method based on neural evolution algorithm proposed in this paper exhibits stable evaluation effects under different computing frameworks. In summary, the computing capability evaluation method for embedded intelligent computers proposed in this paper mainly addresses issues such as resource adaptation, computing potential mining, neural model structural changes, and unified performance evaluation indicators during the evaluation process. It can improve the accuracy of computing capability evaluation for embedded intelligent computers and shorten the product evaluation cycle.

AB - Computing capability evaluation is one of the research hotspots in the field of embedded intelligent computing. It is an important means to test the execution speed and performance of intelligent computers when performing inference tasks. The rationality of its evaluation results directly affects the optimization and improvement direction of embedded intelligent computers. Due to the various optimization schemes for neural form computing and machine learning accelerators, the evaluation of embedded intelligent computers is more difficult than that of general-purpose computers. Benchmark testing is currently a commonly used evaluation method, but on resource-limited embedded devices, the reuse ability of benchmark test sets and evaluation indicators is limited, making it difficult to adapt to the diversified configuration of embedded intelligent systems. The computational intensity of the neural network models in the test set has a certain randomness, which cannot fully explore the computing potential of the device under test, and the evaluation indicators are not unified, making it difficult to compare and analyze the computing capabilities of different embedded intelligent computers. To solve the problem that embedded intelligent devices have different configurations and cannot be performance tested through fixed models, this paper proposes a neural network model generation algorithm based on neural evolution algorithm to generate neural network models that can characterize the intelligent computing capabilities of embedded intelligent devices, and inversely infer the intelligent computing capabilities of embedded intelligent computers based on the complexity of the model. Firstly, based on the Roofline theory model, the advantages of integrating computing potential mining, resource adaptation, and unified evaluation indicators are used to propose a computing capability evaluation framework that can adapt to various embedded intelligent computers, and its rationality is analyzed. Secondly, a neural network model generation algorithm with the goal of generating the maximum complexity model is proposed. By using the neural evolution algorithm, the computational intensity of the generated model approaches the upper limit of the computational intensity of the embedded intelligent computer, fully exploring the computing potential of the device under test, and making the evaluation results more objective. Then, using a fixed upper machine as a reference, the generated neural network model is cross-operated between the device under test and the upper machine. The floating-point operation per second during the two inference task executions is used as the calculation factor to provide a general formula for computing capability evaluation, which can realize the comparative analysis of computing capabilities of different embedded intelligent computers. Finally, in the Mindspore-cpu, Tensorflow-cpu, and Mindspore-ascend310 frameworks, Huawei Atlas200 is evaluated. Compared with the five neural network models commonly used in benchmark tests, the evaluation results of the neural network models generated in this paper are more reasonable, proving that the intelligent computing capability of two DaVinci cores is 42.37 times that of eight Cortex-A55 cores. The experimental results are within the theoretical expected range, indicating that the model generation method based on neural evolution algorithm proposed in this paper exhibits stable evaluation effects under different computing frameworks. In summary, the computing capability evaluation method for embedded intelligent computers proposed in this paper mainly addresses issues such as resource adaptation, computing potential mining, neural model structural changes, and unified performance evaluation indicators during the evaluation process. It can improve the accuracy of computing capability evaluation for embedded intelligent computers and shorten the product evaluation cycle.

KW - Atlas200

KW - computing capability evaluation

KW - embedded intelligent computer

KW - neural evolutionary algorithm

KW - neural network models

UR - http://www.scopus.com/inward/record.url?scp=85175739777&partnerID=8YFLogxK

U2 - 10.11897/SP.J.1016.2023.02279

DO - 10.11897/SP.J.1016.2023.02279

M3 - 文章

AN - SCOPUS:85175739777

SN - 0254-4164

VL - 46

SP - 2279

EP - 2301

JO - Jisuanji Xuebao/Chinese Journal of Computers

JF - Jisuanji Xuebao/Chinese Journal of Computers

IS - 11

ER -

Comрuting Caрability Evaluation Method of Embedded Intelligent Comрuter

摘要

访问文件

其它文件与链接

指纹

引用此