ModelCloud / GPTQModelLinks

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
778Updated this week

Alternatives and similar repositories for GPTQModel

Users that are interested in GPTQModel are comparing it to the libraries listed below

Sorting: