ModelCloud / GPTQModelLinks

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
β˜†1,007Updated this week

Alternatives and similar repositories for GPTQModel

Users that are interested in GPTQModel are comparing it to the libraries listed below

Sorting: