ModelCloud / GPTQModelView on GitHub
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
1,085Apr 4, 2026Updated this week

Alternatives and similar repositories for GPTQModel

Users that are interested in GPTQModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?