Auto-tuning techniques for linear algebra routines on hybrid platforms

Gregorio Bernabé,Javier Cuenca,Luis-Pedro García,Domingo Giménez

Auto-tuning techniques for linear algebra routines on hybrid platforms

2015

Gregorio Bernabé
Javier Cuenca
Luis-Pedro García
Domingo Giménez

Abstract This work analyses two techniques for auto-tuning linear algebra routines for hybrid combinations of multicore CPU and manycore coprocessors (single or multiple GPUs and MIC). The first technique is based on basic models of the execution time of the routines, whereas the second one manages only empirical information obtained during the installation of the routines. The final goal in both cases is to obtain a balanced assignation of the work to the computing components in the system. The study is carried out with a basic kernel (matrix–matrix multiplication) and a higher level routine (LU factorization) which uses the auto-tuned basic routine. Satisfactory results are obtained, with experimental execution times close to the lowest experimentally achievable.

Keywords:

Kernel (linear algebra)
LU decomposition
Multi-core processor
Parallel computing
Theoretical computer science
Xeon Phi
Multiplication
Linear algebra
Computer science
Coprocessor
execution time
auto tuning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations