sony / model_optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆395Updated this week
Alternatives and similar repositories for model_optimization
Users that are interested in model_optimization are comparing it to the libraries listed below
Sorting:
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆292Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,009Updated this week
- This script converts the ONNX/OpenVINO IR model to Tensorflow's saved_model, tflite, h5, tfjs, tftrt(TensorRT), CoreML, EdgeTPU, ONNX and…☆341Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆431Updated 4 months ago
- TFLite model analyzer & memory optimizer☆126Updated last year
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆432Updated 2 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆565Updated last year
- PyTorch Quantization Aware Training Example☆135Updated 11 months ago
- ☆146Updated 2 years ago
- ☆323Updated last year
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆574Updated this week
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆195Updated 11 months ago
- A library for researching neural networks compression and acceleration methods.☆139Updated 8 months ago
- Transform ONNX model to PyTorch representation☆333Updated 6 months ago
- Conversion of PyTorch Models into TFLite☆375Updated 2 years ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆793Updated this week
- Pytorch to Keras/Tensorflow/TFLite conversion made intuitive☆308Updated 2 months ago
- Common utilities for ONNX converters☆269Updated 5 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆107Updated 3 weeks ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆303Updated 8 months ago
- Pytorch implementation of BRECQ, ICLR 2021☆272Updated 3 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆79Updated 4 years ago
- Model compression for ONNX☆92Updated 5 months ago
- A code generator from ONNX to PyTorch code☆136Updated 2 years ago
- Model Quantization Benchmark☆803Updated 3 weeks ago
- ONNX Optimizer☆707Updated 2 weeks ago
- ☆204Updated 3 years ago
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- Quantization of Convolutional Neural networks.☆244Updated 9 months ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆148Updated this week