quic / aimet-model-zooLinks
☆340Updated 2 years ago
Alternatives and similar repositories for aimet-model-zoo
Users that are interested in aimet-model-zoo are comparing it to the libraries listed below
Sorting:
- A parser, editor and profiler tool for ONNX models.☆471Updated 2 months ago
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆428Updated last week
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆862Updated 2 weeks ago
- ☆173Updated 2 weeks ago
- PyTorch Quantization Aware Training Example☆149Updated last year
- Conversion of PyTorch Models into TFLite☆398Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆541Updated 2 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆264Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Updated 4 years ago
- Model Quantization Benchmark☆855Updated 8 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆303Updated last year
- Transform ONNX model to PyTorch representation☆344Updated 2 months ago
- Offline Quantization Tools for Deploy.☆141Updated 2 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,530Updated this week
- A code generator from ONNX to PyTorch code☆142Updated 3 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆452Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆95Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆287Updated 4 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆311Updated last year
- ☆208Updated 4 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆360Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago
- ONNX Optimizer☆787Updated this week
- Roughly calculate FLOPs of a tflite model☆39Updated 4 years ago
- Common utilities for ONNX converters☆290Updated 3 weeks ago
- TFLite model analyzer & memory optimizer☆135Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆279Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆356Updated 2 years ago
- Acuity Model Zoo☆150Updated 3 months ago