AlpinDale / RPTQ-for-LLaMA

Efficient 3bit/4bit quantization of LLaMA models
19Updated last year

Alternatives and similar repositories for RPTQ-for-LLaMA:

Users that are interested in RPTQ-for-LLaMA are comparing it to the libraries listed below