intel / auto-roundLinks

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM. Export your models effortlessly to autogptq, autoawq, gguf and autoround formats with high accuracy even at extremely low bit precision.
551Updated this week

Alternatives and similar repositories for auto-round

Users that are interested in auto-round are comparing it to the libraries listed below

Sorting: