ModelTC / QLLMView on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
39Mar 11, 2024Updated last year

Alternatives and similar repositories for QLLM

Users that are interested in QLLM are comparing it to the libraries listed below

Sorting:

Are these results useful?