Abstract
Original language | English |
---|---|
Journal | ACM Transactions on Architecture and Code Optimization |
Volume | 14 |
Issue number | 3 |
Pages (from-to) | 1-24 |
Number of pages | 24 |
ISSN | 1544-3566 |
DOIs | |
Publication status | Published - 2017 |
Cite this
}
Improving Loop Dependence Analysis. / Jensen, Nicklas Bo; Karlsson, Sven .
In: ACM Transactions on Architecture and Code Optimization, Vol. 14, No. 3, 2017, p. 1-24.Research output: Contribution to journal › Journal article › Research › peer-review
TY - JOUR
T1 - Improving Loop Dependence Analysis
AU - Jensen, Nicklas Bo
AU - Karlsson, Sven
PY - 2017
Y1 - 2017
N2 - Programmers can no longer depend on new processors to have significantly improved single-thread performance. Instead, gains have to come from other sources such as the compiler and its optimization passes. Advanced passes make use of information on the dependencies related to loops. We improve the quality of that information by reusing the information given by the programmer for parallelization. We have implemented a prototype based on GCC into which we also add a new optimization pass. Our approach improves the amount of correctly classified dependencies resulting in 46% average improvement in single-thread performance for kernel benchmarks compared to GCC 6.1.
AB - Programmers can no longer depend on new processors to have significantly improved single-thread performance. Instead, gains have to come from other sources such as the compiler and its optimization passes. Advanced passes make use of information on the dependencies related to loops. We improve the quality of that information by reusing the information given by the programmer for parallelization. We have implemented a prototype based on GCC into which we also add a new optimization pass. Our approach improves the amount of correctly classified dependencies resulting in 46% average improvement in single-thread performance for kernel benchmarks compared to GCC 6.1.
U2 - 10.1145/3095754
DO - 10.1145/3095754
M3 - Journal article
VL - 14
SP - 1
EP - 24
JO - ACM Transactions on Architecture and Code Optimization
JF - ACM Transactions on Architecture and Code Optimization
SN - 1544-3566
IS - 3
ER -