triton-inference-server / openvino_backend
OpenVINO backend for Triton.
☆31Updated 3 weeks ago
Alternatives and similar repositories for openvino_backend:
Users that are interested in openvino_backend are comparing it to the libraries listed below
- The Triton backend for the ONNX Runtime.☆140Updated 3 weeks ago
- The Triton backend for TensorRT.☆70Updated 3 weeks ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆199Updated 2 months ago
- The Triton backend for the PyTorch TorchScript models.☆144Updated 3 weeks ago
- Common source, scripts and utilities for creating Triton backends.☆311Updated last week
- ☆53Updated this week
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆61Updated 2 weeks ago
- The Triton backend for TensorFlow.☆51Updated 3 weeks ago
- Model compression for ONNX☆88Updated 4 months ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆132Updated last week
- Common source, scripts and utilities shared across all Triton repositories.☆69Updated last week
- ☆18Updated 3 weeks ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆466Updated 3 weeks ago
- ☆31Updated 2 years ago
- The core library and APIs implementing the Triton Inference Server.☆123Updated last week
- A Toolkit to Help Optimize Onnx Model☆129Updated this week
- ☆33Updated last year
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated 3 weeks ago
- A Toolkit to Help Optimize Large Onnx Model☆154Updated 10 months ago
- Common utilities for ONNX converters☆261Updated 4 months ago
- ☆69Updated 2 years ago
- FIL backend for the Triton Inference Server☆77Updated last week
- MLPerf™ logging library☆33Updated this week
- ☆58Updated 4 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated 2 weeks ago
- ☆49Updated 3 weeks ago
- ☆30Updated last week
- ☆124Updated last year
- llm deploy project based onnx.☆35Updated 5 months ago
- oneCCL Bindings for Pytorch*☆91Updated this week