openvinotoolkit / nncfLinks

Neural Network Compression Framework for enhanced OpenVINO™ inference

☆1,066

Alternatives and similar repositories for nncf

Users that are interested in nncf are comparing it to the libraries listed below

Sorting:

onnx / optimizer
ONNX Optimizer
☆735Updated 2 weeks ago
intel / neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…
☆2,461Updated last week
ENOT-AutoDL / onnx2torch
Convert ONNX models to PyTorch.
☆691Updated 11 months ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,383Updated this week
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆446Updated last month
NVIDIA / TensorRT-Model-Optimizer
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …
☆1,078Updated 2 weeks ago
ZhangGe6 / onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
☆1,536Updated 5 months ago
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆338Updated 8 months ago
SonySemiconductorSolutions / mct-model-optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…
☆407Updated 3 weeks ago
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆481Updated this week
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆275Updated 2 weeks ago
ModelTC / MQBench
Model Quantization Benchmark
☆826Updated 3 months ago
openvinotoolkit / model_server
A scalable inference server for models optimized with OpenVINO™
☆745Updated this week
alibaba / TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆838Updated 2 months ago
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,818Updated this week
openvinotoolkit / openvino_xai
OpenVINO™ Explainable AI (XAI) Toolkit: Visual Explanation for OpenVINO Models
☆32Updated 4 months ago
intel / ai-reference-models
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…
☆717Updated last week
Tencent / TPAT
TensorRT Plugin Autogen Tool
☆369Updated 2 years ago
quic / aimet-model-zoo
☆332Updated last year
microsoft / nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
☆356Updated last year
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆404Updated this week
onnx / onnx-tensorflow
Tensorflow Backend for ONNX
☆1,312Updated last year
onnx / onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
☆3,124Updated last week
zerollzeng / tiny-tensorrt
Deploy your model with TensorRT quickly.
☆768Updated last year
Zhen-Dong / HAWQ
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
☆443Updated 2 years ago
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆369Updated this week
daquexian / onnx-simplifier
Simplify your onnx model
☆4,121Updated 10 months ago
NVIDIA / sampleQAT
Inference of quantization aware trained networks using TensorRT
☆83Updated 2 years ago
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆296Updated last year
rmccorm4 / tensorrt-utils
⚡ Useful scripts when using TensorRT
☆242Updated 4 years ago