openvinotoolkit / model_serverLinks

A scalable inference server for models optimized with OpenVINO™

☆745

Alternatives and similar repositories for model_server

Users that are interested in model_server are comparing it to the libraries listed below

Sorting:

intel / ai-reference-models
Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…
☆717Updated last week
tensorflow / tensorrt
TensorFlow/TensorRT integration
☆743Updated last year
dlstreamer / dlstreamer
DL Streamer is now part of Open Edge Platform, for latest updates and releases please visit new repo: https://github.com/open-edge-platfo…
☆564Updated 2 weeks ago
onnx / keras-onnx
Convert tf.keras/Keras models to ONNX
☆380Updated 3 years ago
onnx / onnx-tensorflow
Tensorflow Backend for ONNX
☆1,312Updated last year
onnx / onnxmltools
ONNXMLTools enables conversion of models to ONNX
☆1,096Updated last month
openvinotoolkit / openvino_contrib
Repository for OpenVINO's extra modules
☆134Updated last week
openvinotoolkit / nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
☆1,066Updated this week
onnx / onnx-docker
Dockerfiles and scripts for ONNX container images
☆136Updated 2 years ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆481Updated last week
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆481Updated this week
onnx / optimizer
ONNX Optimizer
☆735Updated 2 weeks ago
openvinotoolkit / openvino_tensorflow
OpenVINO™ integration with TensorFlow
☆179Updated last year
openvinotoolkit / docker_ci
The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.
☆67Updated last month
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆635Updated last week
triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆336Updated last week
openvinotoolkit / openvino.genai
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
☆312Updated this week
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆758Updated 3 months ago
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆404Updated this week
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆136Updated last week
microsoft / onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
☆339Updated 9 months ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆275Updated 2 weeks ago
open-edge-platform / training_extensions
Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
☆1,198Updated last week
intel / inference-model-manager
Inference Model Manager for Kubernetes
☆46Updated 6 years ago
Adlik / Adlik
Adlik: Toolkit for Accelerating Deep Learning Inference
☆804Updated last year
NVIDIA / tensorrt-laboratory
Explore the Capabilities of the TensorRT Platform
☆264Updated 3 years ago
onnx / tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
☆2,455Updated 2 weeks ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆210Updated 3 months ago
NVIDIA-AI-IOT / deepstream_tao_apps
Sample apps to demonstrate how to deploy models trained with TAO on DeepStream
☆422Updated 5 months ago
triton-inference-server / python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
☆629Updated this week