Work-in-progress: a high-performance FPGA accelerator for sparse neural networks

Yuntao Lu,Lei Gong,Chongchong Xu,Fan Sun,Yiwei Zhang,Chao Wang,Xuehai Zhou

Work-in-progress: a high-performance FPGA accelerator for sparse neural networks

2017

Neural networks have been widely used in a large range of domains, researchers tune numbers of layrs, neurons and synapses to adapt various applications. As a consequence, computations and memory of neural networks models are both intensive. As large requirements of memory and computing resources, it is difficult to deploy neural networks on resource-limited platforms. Sparse neural networks, which prune redundant neurons and synapses, alleviate computation and memory pressure. However, conventional accelerators cannot benefit from the sparse feature. In this paper, we propose a high-performance FPGA accelerator for sparse neural networks which utilizes eliminate computations and storage space. This work compresses sparse weights and processes compressed data directly. Experimental results demonstrate that our accelerator will reduce 50% and 10% storage of convolutional and full-connected layers, and achieve 3x speedup of performance over an optimized conventional FPGA accelerator.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations