spcl / QuaRotView on GitHub
Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
483Nov 26, 2024Updated last year

Alternatives and similar repositories for QuaRot

Users that are interested in QuaRot are comparing it to the libraries listed below

Sorting:

Are these results useful?