Quantization

Back

Reading

  1. https://www.mathworks.com/company/newsletters/articles/what-is-int8-quantization-and-why-is-it-popular-for-deep-neural-networks.html
  2. https://www.tensorflow.org/lite/performance/post_training_integer_quant
  3. https://on-demand.gputechconf.com/gtc/2017/presentation/s7310-8-bit-inference-with-tensorrt.pdf
  4. https://en.wikipedia.org/wiki/Kullback%E2%80%93Leibler_divergence
  5. https://pytorch.org/docs/stable/quantization.html#best-practices