microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
503Updated this week

Alternatives and similar repositories for BitBLAS:

Users that are interested in BitBLAS are comparing it to the libraries listed below