MIXED PRECISION DNN S:A LL YOU NEED IS AGOOD PARAMETRIZATION

资源分类

2020-01-02 |

53 |

39 |

Abstract

Efficient deep neural network (DNN) inference on mobile or embedded devices typically involvesquantization of the network parameters and activations. In particular, mixed precision networksachieve better performance than networks with homogeneous bitwidth for the same size constraint.Since choosing the optimal bitwidths is not straight forward, training methods, which can learnthem, are desirable. Differentiable quantization with straight-through gradients allows to learn thequantizer’s parameters using gradient methods. We show that a suited parametrization of the quantizeris the key to achieve a stable training and a good final performance. Specifically, we propose toparametrize the quantizer with the step size and dynamic range. The bitwidth can then be inferredfrom them. Other parametrizations, which explicitly use the bitwidth, consistently perform worse. Weconfirm our findings with experiments on CIFAR-10 and ImageNet and we obtain mixed precisionDNNs with learned quantization parameters, achieving state-of-the-art performance.

上一篇：ESTIMATING COUNTERFACTUAL TREATMENTOUTCOMES OVER TIME THROUGH ADVERSARIALLYBALANCED REPRESENTATIONS

下一篇：LEARNING THE ARROW OF TIME FOR PROBLEMS INR EINFORCEMENT LEARNING

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
The Variational S...

Unlike traditional images which do not offer in...
Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com