一种基于 LLVM Pass 的复杂嵌套循环自动并行化框架

Translated title of the contribution: An Automatic Parallelization Framework for Complex Nested Loops Based on LLVM Pass

Chun Yan Ma, Bing Xu Lv, Xu Jiao Ye, Yu Zhang

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

With the popularization of multi-core processors, automatic parallelization of serial codes in embedded legacy systems is a research hotspot. Among them, there are technical challenges in the automatic parallelization method for complex nested loops with imperfect nested structure and non-affine dependency characteristics. This paper proposes an automatic parallelization framework (CNLPF) for complex nested loops based on LLVM Pass. Firstly, a representation model of complex nested loops, namely loop structure tree, is proposed, and the regular region of nested loops is automatically converted into a loop structure tree representation. Then, the data dependency analysis is carried out on the loop structure tree to construct intra-loop and inter-loop dependency relationship. Finally, the parallel loop program is generated based on the OpenMP shared memory programming model. For the 6 program cases in the SPEC2006 data set containing nearly 500 complex nested loops, the statistics of the proportion of complex nested loops and the parallel performance acceleration test were carried out respectively. The results show that the automatic parallelization framework proposed in this paper can deal with complex nested loops that cannot be optimized by LLVM Polly, which enhances the parallel compilation and optimization capabilities of LLVM, and the method combined with Polly optimization improves the acceleration effect of Polly optimization alone by 9%~43%.

Translated title of the contributionAn Automatic Parallelization Framework for Complex Nested Loops Based on LLVM Pass
Original languageChinese (Traditional)
JournalRuan Jian Xue Bao/Journal of Software
Volume34
Issue number7
DOIs
StatePublished - 2023

Fingerprint

Dive into the research topics of 'An Automatic Parallelization Framework for Complex Nested Loops Based on LLVM Pass'. Together they form a unique fingerprint.

Cite this