microsoft / onnxruntime-inference-examplesLinks

Examples for using ONNX Runtime for machine learning inferencing.

☆1,516

Alternatives and similar repositories for onnxruntime-inference-examples

Users that are interested in onnxruntime-inference-examples are comparing it to the libraries listed below

Sorting:

ZhangGe6 / onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
☆1,568Updated 8 months ago
daquexian / onnx-simplifier
Simplify your onnx model
☆4,217Updated 2 months ago
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆418Updated last week
onnx / optimizer
ONNX Optimizer
☆768Updated last week
onnx / tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
☆2,484Updated last month
onnx / onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
☆3,162Updated last month
pnnx / pnnx
PyTorch Neural Network eXchange
☆638Updated last week
PINTO0309 / onnx2tf
Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…
☆869Updated this week
onnx / onnxmltools
ONNXMLTools enables conversion of models to ONNX
☆1,119Updated 4 months ago
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,874Updated last week
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆861Updated last week
cyrusbehr / tensorrt-cpp-api
TensorRT C++ API Tutorial
☆771Updated 11 months ago
leimao / ONNX-Runtime-Inference
ONNX Runtime Inference C++ Example
☆249Updated 6 months ago
xmba15 / onnx_runtime_cpp
small c++ library to quickly deploy models using onnxruntime
☆384Updated last year
microsoft / onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
☆351Updated last year
NVIDIA / trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
☆1,644Updated 5 months ago
intel / neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…
☆2,517Updated this week
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆283Updated last month
CVCUDA / CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
☆2,593Updated 5 months ago
quic / ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.)…
☆815Updated 2 weeks ago
open-mmlab / mmdeploy
OpenMMLab Model Deployment Framework
☆3,059Updated last year
ENOT-AutoDL / onnx2torch
Convert ONNX models to PyTorch.
☆705Updated 2 weeks ago
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆654Updated last week
PaddlePaddle / Paddle2ONNX
ONNX Model Exporter for PaddlePaddle
☆860Updated 3 months ago
microsoft / Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
☆2,163Updated this week
google / XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
☆2,144Updated this week
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆460Updated 2 months ago
google-ai-edge / ai-edge-torch
Supporting PyTorch models with the Google AI Edge TFLite runtime.
☆811Updated this week
triple-Mu / YOLOv8-TensorRT
YOLOv8 using TensorRT accelerate !
☆1,687Updated 6 months ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,471Updated last week