The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆141Feb 26, 2026Updated last week
Alternatives and similar repositories for dali_backend
Users that are interested in dali_backend are comparing it to the libraries listed below
Sorting:
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆673Feb 27, 2026Updated last week
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,406Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆684Feb 24, 2026Updated last week
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆284Jun 2, 2022Updated 3 years ago
- Common source, scripts and utilities for creating Triton backends.☆369Feb 9, 2026Updated 3 weeks ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,637Feb 27, 2026Updated last week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆218Feb 3, 2026Updated last month
- Pilgrim Project: torch2trt, quick convert your pytorch model to TensorRT engine.☆19Oct 10, 2020Updated 5 years ago
- The Triton backend for the PyTorch TorchScript models.☆173Updated this week
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- ☆332Feb 9, 2026Updated 3 weeks ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,948Updated this week
- 训练速度比原始caffe-ssd提升4~6倍☆10Jun 22, 2021Updated 4 years ago
- ☆18Nov 11, 2025Updated 3 months ago
- TF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).☆53Oct 12, 2021Updated 4 years ago
- Triton Migration Guide for DeepStreamSDK.☆15Dec 19, 2023Updated 2 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- The Triton backend for the ONNX Runtime.☆173Feb 25, 2026Updated last week
- ☆14Jun 12, 2015Updated 10 years ago
- How to quickly serve an LLM using Fast API, Celery, and Redis☆16Aug 29, 2023Updated 2 years ago
- Demonstration of the use of TensorRT and TRITON☆16Feb 9, 2021Updated 5 years ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,188Feb 3, 2026Updated last month
- ☆135Updated this week
- This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes☆69Oct 20, 2025Updated 4 months ago
- common in-memory tensor structure☆1,171Jan 26, 2026Updated last month
- DeepStream SDK Python bindings and sample applications☆1,795Oct 14, 2025Updated 4 months ago
- Tencent Distribution of TVM