Look-ahead SLP: auto-vectorization in the presence of commutative operations

Vasileios Porpodas,Rodrigo C. O. Rocha,Luís Fabrício Wanderley Góes

Look-ahead SLP: auto-vectorization in the presence of commutative operations

2018

Auto-vectorizing compilers automatically generate vector (SIMD) instructions out of scalar code. The state-of-the-art algorithm for straight-line code vectorization is Superword-Level Parallelism (SLP). In this work we identify a major limitation at the core of the SLP algorithm, in the performance-critical step of collecting the vectorization candidate instructions that form the SLP-graph data structure. SLP lacks global knowledge when building its vectorization graph, which negatively affects its local decisions when it encounters commutative instructions. We propose LSLP, an improved algorithm that can plug-in to existing SLP implementations, and can effectively vectorize code with arbitrarily long chains of commutative operations. LSLP relies on short-depth look-ahead for better-informed local decisions. Our evaluation on a real machine shows that LSLP can significantly improve the performance of real-world code with little compilation-time overhead.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations