ziplab / QLLMView on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
31Mar 12, 2024Updated 2 years ago

Alternatives and similar repositories for QLLM

Users that are interested in QLLM are comparing it to the libraries listed below

Sorting:

Are these results useful?