triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆125Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for dali_backend
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆185Updated 2 months ago
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆279Updated 2 years ago
- Common source, scripts and utilities for creating Triton backends.☆295Updated this week
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆433Updated last week
- The Triton backend for the ONNX Runtime.☆132Updated this week
- The Triton backend for TensorRT.☆64Updated last week
- A pytorch to tensorrt convert with dynamic shape support☆257Updated 9 months ago
- triton server ensemble model demo☆30Updated 2 years ago
- ☆52Updated 3 years ago
- Common utilities for ONNX converters☆251Updated 5 months ago
- ⚡ Useful scripts when using TensorRT☆240Updated 4 years ago
- Sample app code for deploying TAO Toolkit trained models to Triton☆84Updated 2 months ago
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆570Updated this week
- Count number of parameters / MACs / FLOPS for ONNX models.☆89Updated 3 weeks ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆159Updated last month
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆553Updated this week
- Decode JPEG image on GPU using PyTorch☆84Updated last year
- The Triton backend for the PyTorch TorchScript models.☆127Updated this week
- ☆32Updated 9 months ago
- ☆53Updated 2 years ago
- Inference of quantization aware trained networks using TensorRT☆79Updated last year
- How to deploy open source models using DeepStream and Triton Inference Server☆74Updated 4 months ago
- A project demonstrating how to use nvmetamux to run multiple models in parallel.☆95Updated last month
- TensorRT Plugin Autogen Tool☆367Updated last year
- ☆53Updated 2 years ago
- Actively maintained ONNX Optimizer☆647Updated 8 months ago
- Sample apps to demonstrate how to deploy models trained with TAO on DeepStream☆377Updated last month
- Common source, scripts and utilities shared across all Triton repositories.☆62Updated this week
- A simple tool that can generate TensorRT plugin code quickly.☆221Updated last year
- Implement popular deep learning networks in pytorch, used by tensorrtx.☆191Updated 2 years ago