analogdevicesinc / distillerLinks

Fork of Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller

☆15

Alternatives and similar repositories for distiller

Users that are interested in distiller are comparing it to the libraries listed below

Sorting:

leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆136Updated last year
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆82Updated 2 years ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆93Updated 8 months ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆138Updated 2 years ago
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆262Updated last year
scailable / sclblonnx
Scailable ONNX python tools
☆97Updated 8 months ago
eliberis / tflite-tools
TFLite model analyzer & memory optimizer
☆127Updated last year
marcoancona / TorchPruner
On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…
☆164Updated 5 years ago
juliagusak / model-compression-and-acceleration-progress
Repository to track the progress in model compression and acceleration
☆105Updated 4 years ago
leimao / PyTorch-Static-Quantization
PyTorch Static Quantization Example
☆38Updated 4 years ago
mrusci / training-mixed-precision-quantized-networks
This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…
☆50Updated last year
EunhyeokPark / PROFIT
☆47Updated 3 years ago
DeadAt0m / LSQFakeQuantize-PyTorch
FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch
☆34Updated 3 years ago
mostafaelhoushi / DeepShift
Implementation of "DeepShift: Towards Multiplication-Less Neural Networks" https://arxiv.org/abs/1905.13298
☆112Updated 3 years ago
ivclab / Multistage_Pruning
Cheng-Hao Tu, Jia-Hong Lee, Yi-Ming Chan and Chu-Song Chen, "Pruning Depthwise Separable Convolutions for MobileNet Compression," Interna…
☆17Updated 4 years ago
apxlwl / MobileNet-v2-pruning
Try out different pruning-approaches on lightweight Backbones.
☆145Updated 5 years ago
sony-si / ai-research
☆47Updated 4 years ago
Deelvin / apache-tvm-tutorials
☆10Updated last year
Jzz24 / pytorch_quantization
A pytorch implementation of dorefa quantization
☆113Updated 5 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆97Updated 4 years ago
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
sithu31296 / PyTorch-ONNX-TFLite
Conversion of PyTorch Models into TFLite
☆382Updated 2 years ago
pbridger / tensorrt-ssd300-8bit-quantized
☆52Updated 4 years ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆95Updated 3 years ago
yhhhli / APoT_Quantization
PyTorch implementation for the APoT quantization (ICLR 2020)
☆275Updated 6 months ago
mit-han-lab / apq
[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆158Updated 5 years ago
qinjian623 / pytorch_toys
Personal Pytorch toy script.
☆67Updated 3 years ago
Ironteen / Batch-Normalization-fusion
Batch Normalization Auto-fusion for PyTorch
☆32Updated 5 years ago
kentaroy47 / benchmark-FP32-FP16-INT8-with-TensorRT
Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier
☆56Updated 2 years ago
zhenhuaw-me / tflite
Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/
☆101Updated 4 months ago