SonySemiconductorSolutions / mct-model-optimizationLinks

Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. This project provides researchers, developers, and engineers advanced quantization and compression tools for deploying state-of-the-art neural networks.

☆419

Alternatives and similar repositories for mct-model-optimization

Users that are interested in mct-model-optimization are comparing it to the libraries listed below

Sorting:

PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆298Updated last year
quic / aimet-model-zoo
☆337Updated last year
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆460Updated 2 months ago
PINTO0309 / onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…
☆867Updated 3 months ago
eliberis / tflite-tools
TFLite model analyzer & memory optimizer
☆132Updated last year
leimao / PyTorch-Quantization-Aware-Training
PyTorch Quantization Aware Training Example
☆143Updated last year
levipereira / yolov9-qat
Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.
☆128Updated 6 months ago
sithu31296 / PyTorch-ONNX-TFLite
Conversion of PyTorch Models into TFLite
☆392Updated 2 years ago
PINTO0309 / openvino2tensorflow
This script converts the ONNX/OpenVINO IR model to Tensorflow's saved_model, tflite, h5, tfjs, tftrt(TensorRT), CoreML, EdgeTPU, ONNX and…
☆343Updated 3 years ago
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
NVIDIA / Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
☆218Updated last year
fastmachinelearning / qonnx
QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX
☆161Updated this week
mit-han-lab / mcunet
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…
☆609Updated last year
SonySemiconductorSolutions / mct-quantization-layers
☆23Updated last week
AlexanderLutsenko / nobuco
Pytorch to Keras/Tensorflow/TFLite conversion made intuitive
☆328Updated 7 months ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆94Updated last year
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
DeadAt0m / LSQFakeQuantize-PyTorch
FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch
☆36Updated 3 years ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆281Updated last month
openvinotoolkit / nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
☆1,091Updated this week
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆141Updated 2 years ago
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆404Updated this week
PINTO0309 / tflite2json2tflite
Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.
☆27Updated 2 years ago
tianyic / only_train_once_personal_footprint
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆309Updated last year
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆341Updated 11 months ago
ENOT-AutoDL / onnx2torch
Convert ONNX models to PyTorch.
☆705Updated last week
IBM / qattn
Efficient GPU kernels for mixed-precision Vision Transformers in Triton
☆15Updated last month
mit-han-lab / tinyengine
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…
☆901Updated 11 months ago
quic / qidk
☆164Updated 4 months ago
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆451Updated 2 years ago