Training Deep Neural Networks in 8-bit Fixed Point with Dynamic Shared Exponent Management

Hisakatsu Yamaguchi,Makiko Ito,Katsuhiro Yoda,Atsushi Ike

Training Deep Neural Networks in 8-bit Fixed Point with Dynamic Shared Exponent Management

2021

Hisakatsu Yamaguchi
Makiko Ito
Katsuhiro Yoda
Atsushi Ike

The increase in complexity and depth of deep neural networks (DNNs) has created a strong need to improve computing performance. Quantization methods for training DNNs can effectively improve computation throughput and energy efficiency of hardware platforms. We have developed an 8-bit quantization training method representing the weight, activation, and gradient tensors in an 8-bit fixed point data format. The shared exponent for each tensor is managed dynamically on the basis of the distribution of the tensor elements calculated in the previous training phase, not in the current training phase, which improves computation throughput. This method provides up to 3.7 -times computation throughput compared with FP32 computation without accuracy degradation.

Keywords:

Quantization (signal processing)
Throughput (business)
Fixed point
Computation
Computer science
Algorithm
Tensor
Basis (linear algebra)
Efficient energy use
Artificial neural network

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations