facebookresearch / diffq

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.
235Updated last year

Alternatives and similar repositories for diffq:

Users that are interested in diffq are comparing it to the libraries listed below