ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
☆19,779Apr 7, 2026Updated this week
Alternatives and similar repositories for onnxruntime
Users that are interested in onnxruntime are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open standard for machine learning interoperability☆20,584Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,851Mar 25, 2026Updated last week
- Visualizer for neural network, deep learning and machine learning models☆32,696Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,635Feb 24, 2026Updated last month
- Simplify your onnx model☆4,311Mar 24, 2026Updated 2 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Open Machine Learning Compiler Framework☆13,252Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,977Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,507Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆23,051Updated this week
- A collection of pre-trained, state-of-the-art models in the ONNX format☆9,521Mar 9, 2026Updated 3 weeks ago
- Development repository for the Triton language and compiler☆18,840Updated this week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,201Mar 25, 2026Updated 2 weeks ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆98,800Updated this week
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆10,002Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A library for efficient similarity search and clustering of dense vectors.☆39,628Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆74,805Mar 31, 2026Updated last week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆158,637Apr 1, 2026Updated last week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,990Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,526Updated this week
- LLM inference in C/C++☆101,475Updated this week
- Tensor library for machine learning☆14,340Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,311Updated this week
- Transformer related optimization, including BERT, GPT☆6,410Mar 27, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.☆14,753Updated this week
- Fast and memory-efficient exact attention☆23,185Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,915Updated this week
- Tutorials for creating and using ONNX models☆3,664Jul 15, 2024Updated last year
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,962Updated this week
- Cross-platform, customizable ML solutions for live and streaming media.☆34,446Mar 31, 2026Updated last week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,686Dec 1, 2025Updated 4 months ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,304Updated this week
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆34,284Mar 21, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆42,231Apr 1, 2026Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆22,077Jan 23, 2026Updated 2 months ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,348Jul 3, 2024Updated last year
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search☆43,596Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,348Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆17,048Updated this week
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,286Updated this week