microsoft / onnxruntime-extensionsLinks

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime

☆418

Alternatives and similar repositories for onnxruntime-extensions

Users that are interested in onnxruntime-extensions are comparing it to the libraries listed below

Sorting:

microsoft / onnxconverter-common
Common utilities for ONNX converters
☆281Updated last month
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆404Updated this week
onnx / optimizer
ONNX Optimizer
☆764Updated 3 weeks ago
microsoft / onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
☆351Updated last year
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆162Updated 2 weeks ago
mlc-ai / tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
☆400Updated 2 months ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆495Updated this week
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆228Updated this week
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆861Updated this week
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆502Updated this week
triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆352Updated 2 weeks ago
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆652Updated 2 weeks ago
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆79Updated 2 weeks ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 6 months ago
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆460Updated 2 months ago
microsoft / onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
☆1,516Updated last week
triton-inference-server / pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
☆823Updated 2 months ago
triton-inference-server / python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
☆648Updated last week
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆365Updated 8 months ago
tpoisonooo / llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
☆367Updated 2 years ago
onnx / onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
☆923Updated this week
justinchuby / onnx-safetensors
Use safetensors with ONNX 🤗
☆73Updated 3 weeks ago
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆139Updated 2 weeks ago
intel / onnxruntime
ONNX Runtime: cross-platform, high performance scoring engine for ML models
☆72Updated last week
Talmaj / onnx2pytorch
Transform ONNX model to PyTorch representation
☆340Updated 11 months ago
PINTO0309 / simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…
☆298Updated last year
onnx / neural-compressor
Model compression for ONNX
☆97Updated 11 months ago
triton-inference-server / tutorials
This repository contains tutorials and examples for Triton Inference Server
☆787Updated 2 weeks ago
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆295Updated last year
ENOT-AutoDL / onnx2torch
Convert ONNX models to PyTorch.
☆705Updated last week