sony / model_optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆328Updated this week
Related projects ⓘ
Alternatives and complementary repositories for model_optimization
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆277Updated 6 months ago
- A parser, editor and profiler tool for ONNX models.☆400Updated this week
- ☆302Updated 11 months ago
- ☆122Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆293Updated 2 months ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆413Updated last year
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- Common utilities for ONNX converters☆251Updated 5 months ago
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillati…☆567Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆943Updated this week
- A code generator from ONNX to PyTorch code☆133Updated 2 years ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆89Updated 3 weeks ago
- Actively maintained ONNX Optimizer☆647Updated 8 months ago
- ☆214Updated 2 years ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆480Updated 7 months ago
- Offline Quantization Tools for Deploy.☆116Updated 10 months ago
- This script converts the ONNX/OpenVINO IR model to Tensorflow's saved_model, tflite, h5, tfjs, tftrt(TensorRT), CoreML, EdgeTPU, ONNX and…☆339Updated 2 years ago
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆371Updated this week
- ☆195Updated 3 years ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆258Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆253Updated 3 years ago
- A library for researching neural networks compression and acceleration methods.☆136Updated 2 months ago
- Transform ONNX model to PyTorch representation☆318Updated last week
- PyTorch Quantization Aware Training Example☆122Updated 6 months ago
- PyTorch implementation for the APoT quantization (ICLR 2020)☆268Updated 2 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆458Updated last month
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆338Updated this week
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆82Updated 2 weeks ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆127Updated 3 weeks ago
- Quantization of Convolutional Neural networks.☆238Updated 3 months ago