Cancel

量化

dazuo Jul 2, 2020 2020-07-02T20:19:00+08:00

1 min

《Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference》

摘要

提出了一种量化方案：允许使用整型进行推理，在常用的只支持整型计算单元的硬件上可以比浮点推理效率更高。

还设计了一种训练方案，使得端到端模型精度和时延之间有更好的权衡。

很广泛的研究：在精度损失较小的前提下，减小模型体积和推理时间。主要分为2类：

This post is licensed under CC BY 4.0 by the author.

Recent Update