microsoft / onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
☆15,648Updated this week
Alternatives and similar repositories for onnxruntime:
Users that are interested in onnxruntime are comparing it to the libraries listed below
- Open standard for machine learning interoperability☆18,448Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,184Updated 2 weeks ago
- Tutorials for creating and using ONNX models☆3,451Updated 7 months ago
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆7,824Updated this week
- Visualizer for neural network, deep learning and machine learning models☆29,399Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,752Updated this week
- Simplify your onnx model☆3,976Updated 5 months ago
- A library for efficient similarity search and clustering of dense vectors.☆33,077Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,028Updated this week
- A collection of pre-trained, state-of-the-art models in the ONNX format☆8,267Updated 9 months ago
- Serve, optimize and scale PyTorch models in production☆4,294Updated this week
- Development repository for the Triton language and compiler☆14,452Updated this week
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.☆29,003Updated this week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,296Updated 3 weeks ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,343Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,368Updated 2 weeks ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆13,150Updated this week
- Transformer related optimization, including BERT, GPT☆6,025Updated 10 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,677Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,537Updated 2 weeks ago
- Ongoing research training transformer models at scale☆11,414Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆20,953Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,963Updated this week
- MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM …☆9,666Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆86,974Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,598Updated last week
- Label Studio is a multi-type data labeling and annotation tool with standardized output format☆20,797Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆31,324Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,126Updated 7 months ago
- Fast and memory-efficient exact attention☆15,541Updated this week