sterlind / GPTQ-for-LLaMa

4 bits quantization of LLaMa using GPTQ
11Updated last year

Related projects

Alternatives and complementary repositories for GPTQ-for-LLaMa