HuangOwen/Quantization-Variation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HuangOwen/Quantization-Variation)

HuangOwen / Quantization-Variation

[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precision"

☆48

Alternatives and similar repositories for Quantization-Variation

Users that are interested in Quantization-Variation are comparing it to the libraries listed below

Sorting:

nbasyl / OFQ
View on GitHub
The official implementation of the ICML 2023 paper OFQ-ViT
☆39Oct 3, 2023Updated 2 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆37Aug 20, 2024Updated last year
PingchengDong / GQA-LUT
View on GitHub
The official implementation of the DAC 2024 paper GQA-LUT
☆20Dec 20, 2024Updated last year
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
HuangOwen / RoLoRA
View on GitHub
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆38Sep 24, 2024Updated last year
enyac-group / evol-q
View on GitHub
Quantization in the Jagged Loss Landscape of Vision Transformers
☆13Oct 22, 2023Updated 2 years ago
zhangsichengsjtu / AFPQ
View on GitHub
AFPQ code implementation
☆23Nov 6, 2023Updated 2 years ago
hustvl / PD-Quant
View on GitHub
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆60Mar 23, 2023Updated 2 years ago
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆20Jan 24, 2025Updated last year
liuzechun / Nonuniform-to-Uniform-Quantization
View on GitHub
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆138Apr 28, 2022Updated 3 years ago
htqin / BiBench
View on GitHub
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆56Mar 4, 2024Updated last year
zkkli / RepQ-ViT
View on GitHub
[ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
☆140Jan 10, 2024Updated 2 years ago
Intelligent-Computing-Lab-Panda / GPTAQ
View on GitHub
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆81Jul 28, 2025Updated 7 months ago
rain-neuromorphics / torchmx
View on GitHub
PyTorch Quantization Framework For OCP MX Datatypes.
☆16May 30, 2025Updated 9 months ago
utkarsh-dmx / project-resq
View on GitHub
☆33Mar 28, 2025Updated 11 months ago
ChenMnZ / INT_vs_FP
View on GitHub
A framework to compare low-bit integer and float-point formats
☆66Feb 6, 2026Updated 3 weeks ago
lightmatter-ai / INT-FP-QSim
View on GitHub
Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.
☆51Jul 10, 2023Updated 2 years ago
megvii-research / FQ-ViT
View on GitHub
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆360Apr 11, 2023Updated 2 years ago
Adamdad / Samesame
View on GitHub
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Dec 18, 2019Updated 6 years ago
huqinghao / PalQuant
View on GitHub
☆12Aug 26, 2022Updated 3 years ago
tinganchen / AlignQ
View on GitHub
[CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
☆11Jan 6, 2023Updated 3 years ago
mbalesni / deepspeed_llama
View on GitHub
Finetuning LLaMA with DeepSpeed
☆10Apr 14, 2023Updated 2 years ago
Qualcomm-AI-research / oscillations-qat
View on GitHub
☆79Jul 21, 2022Updated 3 years ago
ModelTC / quant_horizon
View on GitHub
☆11Jan 10, 2025Updated last year
wimh966 / outlier_suppression
View on GitHub
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆49Oct 5, 2022Updated 3 years ago
kriskrisliu / NoisyQuant
View on GitHub
An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
☆26Mar 13, 2024Updated last year
hustzxd / LSQuantization
View on GitHub
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆139Nov 19, 2020Updated 5 years ago
facebookresearch / LLM-QAT
View on GitHub
Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"
☆321Mar 4, 2025Updated 11 months ago
IBM / qattn
View on GitHub
Efficient GPU kernels for mixed-precision Vision Transformers in Triton
☆18Sep 18, 2025Updated 5 months ago
ModelTC / Outlier_Suppression_Plus
View on GitHub
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆50Oct 21, 2023Updated 2 years ago
ustlsh / TransPro
View on GitHub
☆17Mar 14, 2023Updated 2 years ago
OpenBitSys / BitDistiller
View on GitHub
[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
☆134May 16, 2024Updated last year
lliai / EMQ-series
View on GitHub
[ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
☆28Dec 6, 2023Updated 2 years ago
cjf00000 / StatQuant
View on GitHub
code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"
☆29Oct 31, 2020Updated 5 years ago
1hunters / LIMPQ
View on GitHub
Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"
☆61Mar 19, 2023Updated 2 years ago
GATECH-EIC / ShiftAddNAS
View on GitHub
[ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
☆15May 18, 2022Updated 3 years ago
zqu1992 / ALQ
View on GitHub
☆14Oct 24, 2022Updated 3 years ago
zhutmost / lsq-net
View on GitHub
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆310May 8, 2024Updated last year
xushoukai / GDFQ
View on GitHub
official implementation of Generative Low-bitwidth Data Free Quantization(GDFQ)
☆55Jul 23, 2023Updated 2 years ago