This repository serves as an example of deploying the YOLO models on Triton Server for performance and testing purposes
☆69Oct 20, 2025Updated 6 months ago
Alternatives and similar repositories for triton-server-yolo
Users that are interested in triton-server-yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository utilizes the Triton Inference Server Client, which streamlines the complexity of model deployment.☆21Sep 1, 2024Updated last year
- Implementation of Nvidia DeepStream 7 with YOLOv9 Models.☆15Jun 22, 2024Updated last year
- The Purpose of this repository is to create a DeepStream/Triton-Server sample application that utilizes yolov7, yolov7-qat, yolov9 models…☆19Apr 1, 2024Updated 2 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆140Apr 24, 2025Updated last year
- Provides an ensemble model to deploy a YoloV8 ONNX model to Triton☆42Oct 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of End-to-End YOLO Models for DeepStream☆74Feb 26, 2026Updated 2 months ago
- C++ application to perform computer vision tasks using Nvidia Triton Server for model inference☆30Apr 28, 2026Updated last week
- This repository implements the YOLOv9 model on Jetson Orin Nano☆19Aug 28, 2024Updated last year
- ☆18Mar 28, 2024Updated 2 years ago
- YOLOV7 Face Detection☆22Dec 15, 2022Updated 3 years ago
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- Accelerating SAHI-based inference on YOLO models using TensorRT.☆98Jan 6, 2026Updated 4 months ago
- This is a repo with a Triton Server deployment template☆24Aug 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NVIDIA DeepStream SDK 8.0 / 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 application for YOLO-Face models☆80Oct 13, 2025Updated 6 months ago
- A project demonstrating how to make DeepStream docker images.☆92Apr 20, 2026Updated 2 weeks ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- RT-DETRv2 tensorrt C++ 部署☆26Oct 29, 2024Updated last year
- ☆17Oct 16, 2023Updated 2 years ago
- ☆25Oct 10, 2022Updated 3 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- FastSAM 部署rknn C++ 代码☆13May 30, 2024Updated last year
- This is a repository to practice multi-thread programming in C++☆29Feb 21, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FastSAM 部署版本,便于移植不同平,部署简单、运行速度快。☆25May 30, 2024Updated last year
- DeepStream Libraries offer CVCUDA, NvImageCodec, and PyNvVideoCodec modules as Python APIs for seamless integration into custom framewor…☆81Mar 18, 2026Updated last month
- Cpp and python implementation of YOLOv9 using TensorRT API☆123Sep 30, 2024Updated last year
- ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation Transformer☆35May 12, 2025Updated 11 months ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- ☆15Jan 10, 2023Updated 3 years ago
- A shared library of on-demand DeepStream Pipeline Services for Python and C/C++☆341Mar 17, 2025Updated last year
- ☆25Oct 6, 2022Updated 3 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- NVIDIA DeepStream SDK 8.0 / 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 / 5.1 implementation for YOLO models☆2,006Jan 25, 2026Updated 3 months ago
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- [CVPR 2023] OC-SORT implemented in C++ with Eigen Library, Plus a Android Demo Apk☆73Dec 24, 2025Updated 4 months ago
- A project showcasing how to leverage AI coding assistants (Cursor, Claude Code, etc.) for accelerated NVIDIA DeepStream SDK application d…☆51Mar 30, 2026Updated last month
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆140Apr 8, 2026Updated last month
- YOLOv5 on Orin DLA☆223Feb 18, 2024Updated 2 years ago