microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆371Updated this week
Alternatives and similar repositories for onnxruntime-extensions:
Users that are interested in onnxruntime-extensions are comparing it to the libraries listed below
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆329Updated this week
- ONNX Optimizer☆689Updated this week
- Common utilities for ONNX converters☆261Updated 4 months ago
- Examples for using ONNX Runtime for model training.☆330Updated 5 months ago
- The Triton backend for the ONNX Runtime.☆140Updated 3 weeks ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆466Updated 3 weeks ago
- A parser, editor and profiler tool for ONNX models.☆422Updated 2 months ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆834Updated last week
- ONNXMLTools enables conversion of models to ONNX☆1,064Updated 2 months ago
- Accelerate PyTorch models with ONNX Runtime☆358Updated last month
- A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change op…☆291Updated 11 months ago
- A Toolkit to Help Optimize Onnx Model☆129Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆613Updated 2 weeks ago
- A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, et…☆832Updated 2 weeks ago
- TensorRT Plugin Autogen Tool☆369Updated last year
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆199Updated 2 months ago
- A Toolkit to Help Optimize Large Onnx Model☆154Updated 10 months ago
- Scailable ONNX python tools☆97Updated 5 months ago
- Universal cross-platform tokenizers binding to HF and sentencepiece☆316Updated last month
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆456Updated this week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆785Updated last month
- Transform ONNX model to PyTorch representation☆329Updated 4 months ago
- Generative AI extensions for onnxruntime☆667Updated this week
- Common source, scripts and utilities for creating Triton backends.☆311Updated last week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,344Updated 2 months ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,460Updated last month
- The Triton backend for TensorRT.☆70Updated 3 weeks ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆772Updated last week
- Supporting PyTorch models with the Google AI Edge TFLite runtime.☆512Updated this week
- The Triton backend for the PyTorch TorchScript models.☆144Updated 3 weeks ago