openvinotoolkit / model_server
A scalable inference server for models optimized with OpenVINO™
☆694Updated this week
Alternatives and similar repositories for model_server:
Users that are interested in model_server are comparing it to the libraries listed below
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆689Updated this week
- Inference Model Manager for Kubernetes☆46Updated 5 years ago
- TensorFlow/TensorRT integration☆738Updated last year
- This repository is a home to Intel® Deep Learning Streamer (Intel® DL Streamer) Pipeline Framework. Pipeline Framework is a streaming med…☆538Updated 3 weeks ago
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆392Updated 8 months ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆446Updated this week
- Home of Intel(R) Deep Learning Streamer Pipeline Server (formerly Video Analytics Serving)☆126Updated last year
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆198Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆585Updated this week
- Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™☆1,153Updated this week
- Samples for TensorRT/Deepstream for Tesla & Jetson☆1,167Updated last month
- The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.☆62Updated 3 weeks ago
- Tensorflow Backend for ONNX☆1,292Updated 9 months ago
- OpenVINO™ integration with TensorFlow☆179Updated 6 months ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆193Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆430Updated this week
- Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.☆492Updated last year
- Convert tf.keras/Keras models to ONNX☆380Updated 3 years ago
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆576Updated this week
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆300Updated 4 years ago
- ONNXMLTools enables conversion of models to ONNX☆1,044Updated last week
- Multi Model Server is a tool for serving neural net models for inference☆1,002Updated 7 months ago
- Dockerfiles and scripts for ONNX container images☆136Updated 2 years ago
- Describes the full end to end smart parking application that is available with DeepStream 5.0☆342Updated 6 months ago
- TensorFlow models accelerated with NVIDIA TensorRT☆686Updated 3 years ago
- Common source, scripts and utilities for creating Triton backends.☆305Updated this week
- DeepStream SDK Python bindings and sample applications☆1,539Updated 3 months ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆131Updated this week
- Explore the Capabilities of the TensorRT Platform☆261Updated 3 years ago
- Sample apps to demonstrate how to deploy models trained with TAO on DeepStream☆387Updated 2 months ago