microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
594Updated 2 months ago

Alternatives and similar repositories for BitBLAS:

Users that are interested in BitBLAS are comparing it to the libraries listed below