spcl / QuaRotView on GitHub
Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
498Nov 26, 2024Updated last year

Alternatives and similar repositories for QuaRot

Users that are interested in QuaRot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?