ModelTC / QLLM

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
35Updated 11 months ago

Alternatives and similar repositories for QLLM:

Users that are interested in QLLM are comparing it to the libraries listed below