sony / model_optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.
☆348Updated this week
Alternatives and similar repositories for model_optimization:
Users that are interested in model_optimization are comparing it to the libraries listed below
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆283Updated 9 months ago
- A parser, editor and profiler tool for ONNX models.☆414Updated 2 weeks ago
- This script converts the ONNX/OpenVINO IR model to Tensorflow's saved_model, tflite, h5, tfjs, tftrt(TensorRT), CoreML, EdgeTPU, ONNX and…☆341Updated 2 years ago
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆428Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆968Updated this week
- TFLite model analyzer & memory optimizer☆121Updated last year
- Count number of parameters / MACs / FLOPS for ONNX models.☆90Updated 3 months ago
- Conversion of PyTorch Models into TFLite☆365Updated last year
- PyTorch Quantization Aware Training Example☆127Updated 8 months ago
- ☆132Updated last year
- TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillati…☆679Updated 3 weeks ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆422Updated last year
- A code generator from ONNX to PyTorch code☆135Updated 2 years ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆742Updated last week
- Transform ONNX model to PyTorch representation☆324Updated 2 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆312Updated this week
- ☆312Updated last year
- Inference of quantization aware trained networks using TensorRT☆80Updated 2 years ago
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆296Updated 4 months ago
- ☆197Updated 3 years ago
- Common utilities for ONNX converters☆257Updated last month
- Actively maintained ONNX Optimizer☆662Updated this week
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆162Updated 4 years ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆352Updated this week
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆505Updated 10 months ago
- Scailable ONNX python tools☆96Updated 3 months ago
- This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".☆745Updated 2 years ago
- Pytorch implementation of BRECQ, ICLR 2021☆261Updated 3 years ago
- A library for researching neural networks compression and acceleration methods.☆139Updated 5 months ago
- PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.☆260Updated last year