cjf00000 / StatQuantLinks

code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"

☆28

Alternatives and similar repositories for StatQuant

Users that are interested in StatQuant are comparing it to the libraries listed below

Sorting:

papers-submission / structured_transposable_masks
Code for ICML 2021 submission
☆34Updated 4 years ago
ModelTC / Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆46Updated last year
moranshkolnik / RobustQuantization
source code of the paper: Robust Quantization: One Model to Rule Them All
☆40Updated 2 years ago
peiswang / BitSplit
BitSplit Post-trining Quantization
☆50Updated 3 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆95Updated 3 years ago
wimh966 / outlier_suppression
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆48Updated 2 years ago
charbel-sakr / Fixed-Point-Training
Code needed to reproduce results from my ICLR 2019 paper on fixed-point quantization of the backprop algorithm.
☆10Updated 6 years ago
jun-fang / PWLQ
Code for our paper at ECCV 2020: Post-Training Piecewise Linear Quantization for Deep Neural Networks
☆69Updated 3 years ago
cornell-zhang / dnn-quant-ocs
DNN quantization with outlier channel splitting
☆113Updated 5 years ago
Qualcomm-AI-research / oscillations-qat
☆76Updated 3 years ago
allenbai01 / ProxQuant
ProxQuant: Quantized Neural Networks via Proximal Operators
☆29Updated 6 years ago
papers-submission / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆36Updated 2 years ago
ModelTC / quant_horizon
☆11Updated 6 months ago
zhaoweicai / EdMIPS
PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf
☆59Updated 5 years ago
Zhen-Dong / BitPack
BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.
☆57Updated 2 years ago
SHI-Labs / Any-Precision-DNNs
Any-Precision Deep Neural Networks (AAAI 2021)
☆61Updated 5 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆99Updated 4 years ago
yaozhewei / HAP
☆43Updated last year
csyhhu / MetaQuant
Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019
☆55Updated 5 years ago
ModelTC / mqbench-paper
☆44Updated 4 years ago
jakc4103 / scale-adjusted-training
PyTorch implementation of Towards Efficient Training for Neural Network Quantization
☆15Updated 5 years ago
rhhc / ZeroQ-MP
[CVPR'20] ZeroQ Mixed-Precision implementation (unofficial): A Novel Zero Shot Quantization Framework
☆14Updated 4 years ago
jmluu / Awesome-Efficient-Training
A collection of research papers on efficient training of DNNs
☆70Updated 3 years ago
ynahshan / nn-quantization-pytorch
☆57Updated 4 years ago
aojunzz / NM-sparsity
☆236Updated 2 years ago
nbasyl / OFQ
The official implementation of the ICML 2023 paper OFQ-ViT
☆33Updated last year
hustzxd / LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆136Updated 4 years ago
Intelligent-Computing-Lab-Panda / GPTAQ
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆55Updated last week
Qualcomm-AI-research / FP8-quantization
☆154Updated 2 years ago