BARRACUDA: binary-level analysis of runtime RAces in CUDA programs

Ariel Eizenberg,Yuanfeng Peng,Toma Pigli,William Mansky,Joseph Devietti

BARRACUDA: binary-level analysis of runtime RAces in CUDA programs

2017

GPU programming models enable and encourage massively parallel programming with over a million threads, requiring extreme parallelism to achieve good performance. Massive parallelism brings significant correctness challenges by increasing the possibility for bugs as the number of thread interleavings balloons. Conventional dynamic safety analyses struggle to run at this scale. We present BARRACUDA, a concurrency bug detector for GPU programs written in Nvidia’s CUDA language. BARRACUDA handles a wider range of parallelism constructs than previous work, including branch operations, low-level atomics and memory fences, which allows BARRACUDA to detect new classes of concurrency bugs. BARRACUDA operates at the binary level for increased compatibility with existing code, leveraging a new binary instrumentation framework that is extensible to other dynamic analyses. BARRACUDA incorporates a number of novel optimizations that are crucial for scaling concurrency bug detection to over a million threads.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations