openvinotoolkit / model_serverLinks
A scalable inference server for models optimized with OpenVINO™
☆751Updated this week
Alternatives and similar repositories for model_server
Users that are interested in model_server are comparing it to the libraries listed below
Sorting:
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆716Updated last week
- DL Streamer is now part of Open Edge Platform, for latest updates and releases please visit new repo: https://github.com/open-edge-platfo…☆564Updated last month
- TensorFlow/TensorRT integration☆743Updated last year
- OpenVINO™ integration with TensorFlow☆179Updated last year
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆484Updated 2 weeks ago
- Repository for OpenVINO's extra modules☆134Updated last week
- The framework to generate a Dockerfile, build, test, and deploy a docker image with OpenVINO™ toolkit.☆67Updated last month
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆324Updated this week
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆482Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆641Updated last week
- Common source, scripts and utilities for creating Triton backends.☆338Updated 2 weeks ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,074Updated this week
- ONNXMLTools enables conversion of models to ONNX☆1,105Updated 2 months ago
- Reference implementations of MLPerf™ inference benchmarks☆1,441Updated last week
- A performant and modular runtime for TensorFlow☆758Updated 2 weeks ago
- Common utilities for ONNX converters☆276Updated last month
- Adlik: Toolkit for Accelerating Deep Learning Inference☆805Updated last year
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆136Updated last week
- Inference Model Manager for Kubernetes☆46Updated 6 years ago
- Explore the Capabilities of the TensorRT Platform☆264Updated 3 years ago
- Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™☆1,199Updated this week
- Dockerfiles and scripts for ONNX container images☆137Updated 3 years ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆211Updated 3 months ago
- Sample apps to demonstrate how to deploy models trained with TAO on DeepStream☆424Updated 5 months ago
- Learn about the workflow using Intel® Distribution of OpenVINO™ toolkit to accelerate vision, automatic speech recognition, natural langu…☆301Updated last year
- Tensorflow Backend for ONNX☆1,315Updated last year
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆407Updated last week
- ONNX Runtime: cross-platform, high performance scoring engine for ML models☆70Updated last week
- Describes the full end to end smart parking application that is available with DeepStream 5.0☆346Updated last year
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆287Updated 3 years ago