zhangsichengsjtu / AFPQ
AFPQ code implementation
☆20Updated last year
Alternatives and similar repositories for AFPQ:
Users that are interested in AFPQ are comparing it to the libraries listed below
- LLM Inference with Microscaling Format☆22Updated 5 months ago
- The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…☆47Updated 2 years ago
- ☆20Updated 6 months ago
- ☆29Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year
- Code implementation of GPTQv2 (https://arxiv.org/abs/2504.02692)