The Triton backend for TensorRT.
☆88Jun 11, 2026Updated this week
Alternatives and similar repositories for tensorrt_backend
Users that are interested in tensorrt_backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Common source, scripts and utilities for creating Triton backends.☆372May 20, 2026Updated 3 weeks ago
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆693Updated this week
- The Triton backend for the ONNX Runtime.☆177May 23, 2026Updated 3 weeks ago
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆676Jun 2, 2026Updated last week
- ☆345Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Triton TensorRT-LLM Backend☆935Updated this week
- The core library and APIs implementing the Triton Inference Server.☆172Jun 6, 2026Updated last week
- ☆22Updated this week
- The Triton backend for the PyTorch TorchScript models.☆178Updated this week
- ☆27Nov 6, 2024Updated last year
- Common source, scripts and utilities shared across all Triton repositories.☆79May 8, 2026Updated last month
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,750Updated this week
- This repository contains tutorials and examples for Triton Inference Server☆840Updated this week
- custom payload for send nvdsanalytics message to kafka☆23Nov 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unofficial golang package for the Triton Inference Server(https://github.com/triton-inference-server/server)☆51Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆222May 27, 2026Updated 2 weeks ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- A project demonstrating how to make DeepStream docker images.☆92Apr 20, 2026Updated last month
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 8 months ago
- The vLLM XPU kernels for Intel GPU☆47Updated this week
- Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code.☆10Jan 16, 2022Updated 4 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆13Apr 28, 2024Updated 2 years ago
- ☆25Oct 10, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆513Updated this week
- A simple tool that can generate TensorRT plugin code quickly.☆241Jul 11, 2023Updated 2 years ago
- yolov5-deepsort+opencv.kcf+TensorRT+QT☆29Jan 20, 2022Updated 4 years ago
- ☆30Apr 29, 2026Updated last month
- Access all your locally hosted AI Tools from anywhere, by 'Slinging' your browser to them via a 'Remote Intelligent Neural Gateway.'☆16Dec 26, 2023Updated 2 years ago
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆125Aug 15, 2023Updated 2 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆13,061Jun 3, 2026Updated last week
- PaddleOCR Lite license plate detection on bare Raspberry Pi 4☆12Apr 16, 2024Updated 2 years ago
- Custom gst-nvinfer for alignment in Deepstream☆31Nov 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)☆12Jun 13, 2023Updated 3 years ago
- Nvidia HairWorks OpenGL implementation☆12Apr 30, 2016Updated 10 years ago
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆844Aug 13, 2025Updated 10 months ago
- This repository contains the results and code for the MLPerf™ Inference v2.1 benchmark.☆18Jul 24, 2025Updated 10 months ago
- Official implementation of the ICLR 2024 paper AffineQuant☆30Mar 30, 2024Updated 2 years ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated last year
- A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative…☆2,891Updated this week