SqueezeBits / QUICK
View external linksLinks

QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference
120Mar 6, 2024Updated last year

Alternatives and similar repositories for QUICK

Users that are interested in QUICK are comparing it to the libraries listed below

Sorting:

Are these results useful?