bytedance / AffineQuantLinks
Official implementation of the ICLR 2024 paper AffineQuant
☆28Updated last year
Alternatives and similar repositories for AffineQuant
Users that are interested in AffineQuant are comparing it to the libraries listed below
Sorting:
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆166Updated this week
- [ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"☆192Updated this week
- Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)