openvinotoolkit / model_serverLinks
A scalable inference server for models optimized with OpenVINO™
☆797Updated this week
Alternatives and similar repositories for model_server
Users that are interested in model_server are comparing it to the libraries listed below
Sorting:
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆720Updated 2 weeks ago
- TensorFlow/TensorRT integration☆744Updated 2 years ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆499Updated this week
- Dockerfiles and scripts for ONNX container images☆138Updated 3 years ago
- Common source, scripts and utilities for creating Triton backends.☆360Updated 2 weeks ago
- DL Streamer is now part of Open Edge Platform, for latest updates and releases please visit new repo: https://github.com/open-edge-platfo…☆565Updated 4 months ago
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆663Updated 2 weeks ago
- Convert tf.keras/Keras models to ONNX☆381Updated 4 years ago
- ONNXMLTools enables conversion of models to ONNX☆1,128Updated 5 months ago
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆374Updated this week
- OpenVINO™ integration with TensorFlow☆178Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,109Updated this week
- Repository for OpenVINO's extra modules☆150Updated last week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆213Updated 7 months ago
- Common utilities for ONNX converters☆285Updated 2 months ago
- Tensorflow Backend for ONNX☆1,326Updated last year
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆139Updated 2 weeks ago
- The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.☆67Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆513Updated this week
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆655Updated this week
- ONNX Optimizer☆780Updated 3 weeks ago
- Adlik: Toolkit for Accelerating Deep Learning Inference☆810Updated last year
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆304Updated 4 years ago
- Examples for using ONNX Runtime for model training.☆357Updated last year
- Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.☆497Updated 2 years ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆429Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,499Updated 2 months ago
- The core library and APIs implementing the Triton Inference Server.☆156Updated last week
- Reference implementations of MLPerf® inference benchmarks☆1,495Updated this week
- Explore the Capabilities of the TensorRT Platform☆264Updated 4 years ago