quic / aimet-model-zooLinks
☆328Updated last year
Alternatives and similar repositories for aimet-model-zoo
Users that are interested in aimet-model-zoo are comparing it to the libraries listed below
Sorting:
- A parser, editor and profiler tool for ONNX models.☆442Updated 2 weeks ago
- ☆204Updated 3 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,340Updated this week
- PyTorch Quantization Aware Training Example☆136Updated last year
- Inference of quantization aware trained networks using TensorRT☆82Updated 2 years ago
- A code generator from ONNX to PyTorch code☆138Updated 2 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆262Updated last year
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆833Updated last month
- A simple network quantization demo using pytorch from scratch.☆533Updated 2 years ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆400Updated this week
- Model Quantization Benchmark☆816Updated 2 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆296Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,046Updated last week
- ☆143Updated 3 months ago
- ONNX Optimizer☆723Updated last week
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆439Updated 2 years ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆34Updated 3 years ago
- ☆149Updated 2 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 3 years ago
- Common utilities for ONNX converters☆272Updated 6 months ago
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆164Updated 5 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆280Updated last year
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆354Updated 10 months ago
- Pytorch implementation of BRECQ, ICLR 2021☆276Updated 3 years ago
- Conversion of PyTorch Models into TFLite☆382Updated 2 years ago
- Offline Quantization Tools for Deploy.☆129Updated last year
- PyTorch implementation for the APoT quantization (ICLR 2020)☆275Updated 6 months ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆400Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆93Updated 8 months ago