spcl / QuaRotLinks

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
390Updated 6 months ago

Alternatives and similar repositories for QuaRot

Users that are interested in QuaRot are comparing it to the libraries listed below

Sorting: