spcl / QuaRotLinks

Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
429Updated 10 months ago

Alternatives and similar repositories for QuaRot

Users that are interested in QuaRot are comparing it to the libraries listed below

Sorting: