intel / auto-roundLinks

Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Transformers, vLLM, SGLang, and llm-compressor
775Updated this week

Alternatives and similar repositories for auto-round

Users that are interested in auto-round are comparing it to the libraries listed below

Sorting: