openvinotoolkit / model_serverLinks
A scalable inference server for models optimized with OpenVINO™
☆804Updated this week
Alternatives and similar repositories for model_server
Users that are interested in model_server are comparing it to the libraries listed below
Sorting:
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆723Updated this week
- TensorFlow/TensorRT integration☆744Updated 2 years ago
- Deep Learning Streamer (DL Streamer) Pipeline Framework is an open-source streaming media analytics framework, based on GStreamer* multim…☆567Updated this week
- The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.☆68Updated 3 weeks ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆501Updated last week
- Repository for OpenVINO's extra modules☆153Updated 2 weeks ago
- Dockerfiles and scripts for ONNX container images☆138Updated 3 years ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆518Updated this week
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆398Updated this week
- Common source, scripts and utilities for creating Triton backends.☆362Updated 2 weeks ago
- Common utilities for ONNX converters☆289Updated last week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆667Updated last week
- Convert tf.keras/Keras models to ONNX☆381Updated 4 years ago
- Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.☆496Updated 2 years ago
- ONNXMLTools enables conversion of models to ONNX☆1,130Updated 3 weeks ago
- OpenVINO™ integration with TensorFlow☆178Updated last year
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆216Updated 8 months ago
- Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™☆1,208Updated this week
- Inference Model Manager for Kubernetes☆46Updated 6 years ago
- Sample apps to demonstrate how to deploy models trained with TAO on DeepStream☆437Updated last month
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,111Updated this week
- Tensorflow Backend for ONNX☆1,327Updated last year
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆431Updated this week
- Explore the Capabilities of the TensorRT Platform☆264Updated 4 years ago
- Reference implementations of MLPerf® inference benchmarks☆1,506Updated this week
- ONNX Optimizer☆780Updated last month
- An example of using DeepStream SDK for redaction☆211Updated last year
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆139Updated last week
- Describes the full end to end smart parking application that is available with DeepStream 5.0☆348Updated last year
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆392Updated last year