sophgo / sophgo-mqLinks

Model Quantization Benchmark

☆17

Alternatives and similar repositories for sophgo-mq

Users that are interested in sophgo-mq are comparing it to the libraries listed below

Sorting:

ModelTC / mqbench-paper
☆44Updated 4 years ago
ModelTC / LPCV2021_Winner_Solution
☆28Updated 3 years ago
peiswang / BitSplit
BitSplit Post-trining Quantization
☆50Updated 3 years ago
bytedance / MRECG
☆36Updated 2 years ago
GATECH-EIC / DepthShrinker
[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …
☆35Updated 3 years ago
ModelTC / quant_horizon
☆11Updated 8 months ago
LaVieEnRoseSMZ / AutoBNN
☆47Updated 5 years ago
BillAmihom / RAPQ
Pytorch implementation of RAPQ, IJCAI 2022
☆23Updated 2 years ago
moranshkolnik / RobustQuantization
source code of the paper: Robust Quantization: One Model to Rule Them All
☆40Updated 2 years ago
DeadAt0m / LSQFakeQuantize-PyTorch
FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch
☆35Updated 3 years ago
MegEngine / examples
A set of examples around MegEngine
☆31Updated last year
ynahshan / nn-quantization-pytorch
☆57Updated 4 years ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆95Updated 3 years ago
aim-uofa / model-quantization
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆44Updated 4 years ago
LaVieEnRoseSMZ / OQA
☆28Updated 4 years ago
PannenetsF / TQT
TQT's pytorch implementation.
☆21Updated 3 years ago
ChenShisen / ncnnqat
quantize aware training package for NCNN on pytorch
☆69Updated 4 years ago
Ironteen / Batch-Normalization-fusion
Batch Normalization Auto-fusion for PyTorch
☆32Updated 5 years ago
papers-submission / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆36Updated 2 years ago
hustzxd / EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
☆37Updated 3 years ago
mostafaelhoushi / DeepShift
Implementation of "DeepShift: Towards Multiplication-Less Neural Networks" https://arxiv.org/abs/1905.13298
☆112Updated 3 years ago
HuangOwen / QAT-ACS
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆34Updated last year
lmbxmu / CLR-RNF
Pytorch implementation of our paper (TNNLS) -- Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest Filters
☆12Updated 3 years ago
Adamdad / Samesame
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Updated 5 years ago
Qualcomm-AI-research / oscillations-qat
☆76Updated 3 years ago
cap-lab / S3NAS
Fast NPU-aware Neural Architecture Search
☆22Updated 4 years ago
OAID / TengineInferPipe
☆24Updated 2 years ago
sony-si / ai-research
☆48Updated 5 years ago
lmbxmu / 1xN
Pytorch implementation of TPAMI 2022 -- 1xN Pattern for Pruning Convolutional Neural Networks
☆42Updated 3 years ago
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆46Updated last year