quic / aimet-model-zooLinks
☆332Updated last year
Alternatives and similar repositories for aimet-model-zoo
Users that are interested in aimet-model-zoo are comparing it to the libraries listed below
Sorting:
- A parser, editor and profiler tool for ONNX models.☆451Updated 3 weeks ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆842Updated last week
- Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…☆412Updated last month
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,432Updated last week
- ☆154Updated 2 months ago
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆297Updated last year
- PyTorch Quantization Aware Training Example☆140Updated last year
- Model Quantization Benchmark☆829Updated 4 months ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆34Updated 3 years ago
- ☆206Updated 3 years ago
- A code generator from ONNX to PyTorch code☆139Updated 2 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆262Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆94Updated 10 months ago
- Transform ONNX model to PyTorch representation☆338Updated 9 months ago
- A simple network quantization demo using pytorch from scratch.☆534Updated 2 years ago
- Conversion of PyTorch Models into TFLite☆389Updated 2 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆357Updated last year
- Common utilities for ONNX converters☆277Updated this week
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆442Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆309Updated 11 months ago
- Offline Quantization Tools for Deploy.☆136Updated last year
- ONNX Optimizer☆745Updated 3 weeks ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆594Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆282Updated 4 years ago
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆141Updated 3 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆402Updated 2 years ago
- ☆158Updated 2 years ago
- VeriSilicon Tensor Interface Module☆237Updated 7 months ago