openvinotoolkit / model_server
A scalable inference server for models optimized with OpenVINO™
☆723Updated this week
Alternatives and similar repositories for model_server:
Users that are interested in model_server are comparing it to the libraries listed below
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆711Updated this week
- TensorFlow/TensorRT integration☆742Updated last year
- Inference Model Manager for Kubernetes☆46Updated 6 years ago
- This repository is a home to Intel® Deep Learning Streamer (Intel® DL Streamer) Pipeline Framework. Pipeline Framework is a streaming med…☆550Updated 2 weeks ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,004Updated this week
- A multi-user, distributed computing environment for running DL model training experiments on Intel® Xeon® Scalable processor-based system…☆392Updated 11 months ago
- ONNXMLTools enables conversion of models to ONNX☆1,074Updated 3 months ago
- Home of Intel(R) Deep Learning Streamer Pipeline Server (formerly Video Analytics Serving)☆126Updated last year
- The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.☆65Updated 3 weeks ago
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆268Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆460Updated this week
- Repository for OpenVINO's extra modules☆119Updated 3 weeks ago
- Convert tf.keras/Keras models to ONNX☆378Updated 3 years ago
- A performant and modular runtime for TensorFlow☆761Updated 2 weeks ago
- Common source, scripts and utilities for creating Triton backends.☆318Updated last week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆474Updated 2 weeks ago
- Explore the Capabilities of the TensorRT Platform☆264Updated 3 years ago
- Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™☆1,163Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆620Updated last week
- Software Development Kit (SDK) for the Intel® Geti™ platform for Computer Vision AI model training.☆87Updated this week
- ONNX Optimizer☆700Updated this week
- Common utilities for ONNX converters☆268Updated 5 months ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆199Updated 2 weeks ago
- Tools for easier OpenVINO development/debugging☆10Updated last month
- OpenVINO™ integration with TensorFlow☆179Updated 10 months ago
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,409Updated 3 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆377Updated this week
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆132Updated last week
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆283Updated 2 years ago
- Save, Load Frozen Graph and Run Inference From Frozen Graph in TensorFlow 1.x and 2.x☆302Updated 4 years ago