analogdevicesinc / distiller
Fork of Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://nervanasystems.github.io/distiller
☆14Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for distiller
- PyTorch Quantization Aware Training Example☆123Updated 5 months ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆88Updated 2 weeks ago
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- Scailable ONNX python tools☆96Updated 2 weeks ago
- A code generator from ONNX to PyTorch code☆132Updated last year
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆258Updated last year
- PyTorch Pruning Example☆46Updated last year
- A parser, editor and profiler tool for ONNX models.☆398Updated last month
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆277Updated 6 months ago
- ☆52Updated 3 years ago
- PyTorch Static Quantization Example☆39Updated 3 years ago
- ☆298Updated 11 months ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆32Updated 2 years ago
- Roughly calculate FLOPs of a tflite model☆36Updated 3 years ago
- TFLite model analyzer & memory optimizer☆120Updated 9 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆392Updated last year
- quantize aware training package for NCNN on pytorch☆68Updated 3 years ago
- Batch Normalization Auto-fusion for PyTorch☆32Updated 4 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 2 years ago
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆160Updated 4 years ago
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆80Updated 3 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆413Updated last year
- Script to typecast ONNX model parameters from INT64 to INT32.☆97Updated 6 months ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆49Updated 6 months ago
- ☆21Updated 2 years ago
- A package to make do Network Slimming a little easier☆47Updated 2 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆324Updated this week
- PyTorch implementation for the APoT quantization (ICLR 2020)☆267Updated 2 years ago
- Parallel CUDA implementation of NON maximum Suppression☆77Updated 4 years ago
- ☆25Updated 2 years ago