intel / auto-roundView on GitHub
🎯An accuracy-first, highly efficient quantization toolkit for LLMs, designed to minimize quality degradation across Weight-Only Quantization, MXFP4, NVFP4, GGUF, and adaptive schemes.
β˜†845Feb 14, 2026Updated 2 weeks ago

Alternatives and similar repositories for auto-round

Users that are interested in auto-round are comparing it to the libraries listed below

Sorting:

Are these results useful?