sony / model_optimizationLinks

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

☆400

Alternatives and similar repositories for model_optimization

Users that are interested in model_optimization are comparing it to the libraries listed below

Sorting:

PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆442Updated 2 weeks ago
quic / aimet-model-zoo
☆328Updated last year
PINTO0309 / openvino2tensorflow
This script converts the ONNX/OpenVINO IR model to Tensorflow's saved_model, tflite, h5, tfjs, tftrt(TensorRT), CoreML, EdgeTPU, ONNX and…
☆342Updated 2 years ago
levipereira / yolov9-qat
Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.
☆113Updated 2 months ago
eliberis / tflite-tools
TFLite model analyzer & memory optimizer
☆127Updated last year
Qualcomm-AI-research / FP8-quantization
☆149Updated 2 years ago
onnx / optimizer
ONNX Optimizer
☆723Updated last week
openvinotoolkit / nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
☆1,051Updated this week
SonySemiconductorSolutions / mct-quantization-layers
☆21Updated last month
sithu31296 / PyTorch-ONNX-TFLite
Conversion of PyTorch Models into TFLite
☆382Updated 2 years ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
PINTO0309 / onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…
☆814Updated last week
AlexanderLutsenko / nobuco
Pytorch to Keras/Tensorflow/TFLite conversion made intuitive
☆313Updated 3 months ago
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆338Updated 7 months ago
mit-han-lab / mcunet
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…
☆579Updated last year
ModelTC / Dipoorlet
Offline Quantization Tools for Deploy.
☆129Updated last year
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆82Updated 2 years ago
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆136Updated last year
NVIDIA-AI-IOT / jetson_dla_tutorial
A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson
☆332Updated 3 years ago
NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆200Updated last year
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆272Updated 6 months ago
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆441Updated 2 years ago
iwatake2222 / InferenceHelper
C++ Helper Class for Deep Learning Inference Frameworks: TensorFlow Lite, TensorRT, OpenCV, OpenVINO, ncnn, MNN, SNPE, Arm NN, NNabla, ON…
☆289Updated 3 years ago
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆276Updated 3 years ago
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆150Updated this week
jakc4103 / DFQ
PyTorch implementation of Data Free Quantization Through Weight Equalization and Bias Correction.
☆262Updated last year
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆93Updated 8 months ago
eliberis / uNAS
μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.
☆80Updated 4 years ago
onnx / neural-compressor
Model compression for ONNX
☆96Updated 7 months ago