ModelCloud / GPTQModelView on GitHub
LLM model quantization (compression) toolkit with HW acceleration support for Nvidia, AMD, Intel GPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
1,121Apr 24, 2026Updated this week

Alternatives and similar repositories for GPTQModel

Users that are interested in GPTQModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?