deepglint / EasyQuantLinks

EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.

☆402

Alternatives and similar repositories for EasyQuant

Users that are interested in EasyQuant are comparing it to the libraries listed below

Sorting:

aovoc / nnieqat-pytorch
A nnie quantization aware training tool on pytorch.
☆239Updated 4 years ago
inisis / brocolli
Everything in Torch Fx
☆344Updated last year
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆262Updated last year
ModelTC / MQBench
Model Quantization Benchmark
☆826Updated 3 months ago
BUG1989 / caffe-int8-convert-tools
Generate a quantization parameter file for ncnn framework int8 inference
☆518Updated 5 years ago
AI-performance / embedded-ai.bench
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
☆204Updated 4 years ago
ModelTC / Dipoorlet
Offline Quantization Tools for Deploy.
☆131Updated last year
Jermmy / pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
☆536Updated 2 years ago
hey-yahei / Quantization.MXNet
Simulate quantization and quantization aware training for MXNet-Gluon models.
☆46Updated 5 years ago
BBuf / onnx2X
ONNX2Pytorch
☆162Updated 4 years ago
htshinichi / caffe-onnx
caffe model convert to onnx model
☆176Updated 2 years ago
MTLab / onnx2caffe
pytorch to caffe by onnx
☆375Updated 5 years ago
CoCoPIE-Pruning / CoCoPIE-ModelZoo
☆125Updated 4 years ago
bindog / onnx-surgery
☆81Updated 4 years ago
A-suozhang / awesome-quantization-and-fixed-point-training
Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design
☆161Updated 4 years ago
THU-MIG / torch-model-compression
针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库
☆250Updated 2 years ago
ArtyZe / yolo_quantization
Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
☆64Updated 4 years ago
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆446Updated last month
lswzjuer / pytorch-quantity
An 8bit automated quantization conversion tool for the pytorch (Post-training quantization based on KL divergence)
☆33Updated 5 years ago
mit-han-lab / amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆443Updated last year
mit-han-lab / haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆392Updated 4 years ago
rmccorm4 / tensorrt-utils
⚡ Useful scripts when using TensorRT
☆242Updated 4 years ago
Ironteen / YOLOv3-quantization-model-v1.0
YOLOv3 quantization model v10, only for quantization off-line
☆21Updated 6 years ago
apxlwl / MobileNet-v2-pruning
Try out different pruning-approaches on lightweight Backbones.
☆146Updated 5 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆282Updated 4 years ago
FrozenGene / tvm-tutorial
TVM tutorial
☆66Updated 6 years ago
xxradon / ONNXToCaffe
pytorch -> onnx -> caffe, pytorch to caffe, or other deep learning framework to onnx and onnx to caffe.
☆163Updated 4 years ago
TrojanXu / onnxparser-trt-plugin-sample
A sample for onnxparser working with trt user defined plugins for TRT7.0
☆168Updated 4 years ago
grimoire / amirstan_plugin
Useful tensorrt plugin. For pytorch and mmdetection model conversion.
☆165Updated 9 months ago