0cc4m / GPTQ-for-LLaMa

4 bits quantization of LLMs using GPTQ
47Updated last year

Alternatives and similar repositories for GPTQ-for-LLaMa:

Users that are interested in GPTQ-for-LLaMa are comparing it to the libraries listed below