WapaMario63 / GPTQ-for-LLaMa-ROCm

4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
32Updated last year

Related projects

Alternatives and complementary repositories for GPTQ-for-LLaMa-ROCm