Intelligent-Computing-Lab-Yale / GPTAQLinks
Code implementation of GPTQv2 (https://arxiv.org/abs/2504.02692)
☆45Updated 2 weeks ago
Alternatives and similar repositories for GPTAQ
Users that are interested in GPTAQ are comparing it to the libraries listed below
Sorting:
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆136Updated 2 weeks ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year
- Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"