analogdevicesinc / distiller
Fork of Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
☆14Updated 8 months ago
Alternatives and similar repositories for distiller:
Users that are interested in distiller are comparing it to the libraries listed below
- PyTorch Quantization Aware Training Example☆132Updated 10 months ago
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆89Updated 5 months ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆262Updated last year
- TFLite model analyzer & memory optimizer☆124Updated last year
- PyTorch Static Quantization Example☆38Updated 3 years ago
- ☆225Updated 3 years ago
- ☆316Updated last year
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆164Updated 4 years ago
- Try out different pruning-approaches on lightweight Backbones.☆146Updated 4 years ago
- Scailable ONNX python tools☆97Updated 5 months ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆277Updated last year
- A code generator from ONNX to PyTorch code☆135Updated 2 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆98Updated 2 months ago
- ☆127Updated 4 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆33Updated 3 years ago
- Implementation of "DeepShift: Towards Multiplication-Less Neural Networks" https://arxiv.org/abs/1905.13298☆111Updated 3 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆157Updated 4 years ago
- Repository to track the progress in model compression and acceleration☆105Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆96Updated 3 years ago
- Repository containing pruned models and related information☆37Updated 4 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆395Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆422Updated 2 months ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆429Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆270Updated 3 years ago
- Roughly calculate FLOPs of a tflite model☆38Updated 3 years ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆291Updated 11 months ago
- A nnie quantization aware training tool on pytorch.☆239Updated 4 years ago
- quantize aware training package for NCNN on pytorch☆70Updated 3 years ago
- A pytorch implementation of dorefa quantization☆113Updated 5 years ago