intel / auto-roundLinks

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM.
483Updated this week

Alternatives and similar repositories for auto-round

Users that are interested in auto-round are comparing it to the libraries listed below

Sorting: