YH-Wu/Triton-Inference-Server-on-Kubernetes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YH-Wu/Triton-Inference-Server-on-Kubernetes)

YH-Wu / Triton-Inference-Server-on-Kubernetes

☆34

Alternatives and similar repositories for Triton-Inference-Server-on-Kubernetes

Users that are interested in Triton-Inference-Server-on-Kubernetes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / healthcare-on-tap-TRT-TRITON-demo
View on GitHub
Demonstration of the use of TensorRT and TRITON
☆16Feb 9, 2021Updated 5 years ago
GluuFederation / community-edition-containers
View on GitHub
Manifest files for CE container packages
☆13Oct 14, 2024Updated last year
triton-inference-server / stateful_backend
View on GitHub
Triton backend for managing the model state tensors automatically in sequence batcher
☆17Feb 12, 2024Updated 2 years ago
Bobo-y / triton_ensemble_model_demo
View on GitHub
triton server ensemble model demo
☆30May 2, 2022Updated 4 years ago
tonhathuy / tensorrt-triton-magface
View on GitHub
Magface Triton Inferece Server Using Tensorrt
☆19Feb 12, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ailia-ai / onnx-quantization
View on GitHub
Example of onnx quantization
☆11Feb 8, 2023Updated 3 years ago
NVIDIA-AI-IOT / deepstream_triton_model_deploy
View on GitHub
How to deploy open source models using DeepStream and Triton Inference Server
☆87Jun 27, 2024Updated 2 years ago
isarsoft / yolov4-triton-tensorrt
View on GitHub
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
☆283Jun 2, 2022Updated 4 years ago
cartovarc / opencv-to-webrtc
View on GitHub
Streaming example using Python, OpenCV, NodeJS and WebRTC.
☆12Apr 7, 2020Updated 6 years ago
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
annaformaniuk / smoke-detection
View on GitHub
☆11Apr 13, 2019Updated 7 years ago
Curt-Park / mnist-fastapi-celery-triton
View on GitHub
Simple example of FastAPI + Celery + Triton for benchmarking
☆65Aug 11, 2022Updated 3 years ago
kankadev / frigate-gdrive-instant-uploader
View on GitHub
Uploads Frigate clips to Google Drive using MQTT & Frigate API
☆12Jun 16, 2026Updated last month
rivia7 / faster-bert-as-service
View on GitHub
Using TensorRT and Triton Server to build BERT model as a service
☆13Jan 10, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
huynhbaobk / tensorrt-triton-yolov5
View on GitHub
☆53Jan 24, 2022Updated 4 years ago
tkhe / detective
View on GitHub
支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署
☆13Mar 17, 2024Updated 2 years ago
Pseudo-Lab / deep-learning-glossary
View on GitHub
☆11Oct 17, 2023Updated 2 years ago
TencentYoutuResearch / MOT-CTracker
View on GitHub
Code for ECCV 2020 paper (Spotlight) "Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object …
☆12Apr 22, 2021Updated 5 years ago
yas-sim / openvino-ep-enabled-onnxruntime
View on GitHub
Describing How to Enable OpenVINO Execution Provider for ONNX Runtime
☆20Jun 29, 2020Updated 6 years ago
avik-das / seam-carver
View on GitHub
Implementation of the content-aware image resizing algorithm presented in the paper "Seam carving for content-aware image resizing"
☆13Jul 22, 2019Updated 7 years ago
peterkim97 / COVID-911
View on GitHub
☆11Mar 1, 2021Updated 5 years ago
system76 / bottle
View on GitHub
Protobuf messages in a bottle
☆10Feb 14, 2025Updated last year
amazon-archives / fully-automated-neo4j-to-neptune
View on GitHub
This AWS CDK app helps you migrate the simple Neo4j movies graph database to Amazon Neptune in a hands-free, fully automated way.
☆14May 3, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
1095788063 / deepstream-python-rtsp-video-h264-gstreamer
View on GitHub
deepstream python play rtsp h264
☆14Jan 10, 2022Updated 4 years ago
pedro-gutierrez / momo
View on GitHub
Modular Monoliths in Elixir
☆12Mar 17, 2026Updated 4 months ago
cmcmurrough / teaching
View on GitHub
Repository containing files related to CSE curriculum
☆16Nov 11, 2018Updated 7 years ago
dosemeion / yolov5-p6-tensorrt
View on GitHub
☆13May 11, 2021Updated 5 years ago
Discord-TTS / tts-service
View on GitHub
A HTTP microservice to generate TTS
☆15May 5, 2026Updated 2 months ago
aws-samples / aws-parallelcluster-megatron
View on GitHub
☆15Mar 15, 2021Updated 5 years ago
memryx / MxAccl
View on GitHub
MxAccl: open-source code for both the MemryX C++ runtime library and the acclBench benchmarking tool. These components enable seamless in…
☆19Apr 2, 2026Updated 3 months ago
Nebula4869 / YOLOv5-ncnn-vulkan
View on GitHub
An ncnn-vulkan implementation of YOLOv5, capable of using GPU to accelerate inference
☆15Nov 18, 2021Updated 4 years ago
microsoft / Inventory-Management-for-IoT-Connected-Coolers-Solution-Accelerator
View on GitHub
Accelerates creation of an inventory management solution for product manufactures and suppliers
☆17Jul 9, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Percent-BFD / neurips_submission
View on GitHub
☆17Nov 23, 2023Updated 2 years ago
IBMSpectrumComputing / lsf-hybrid-cloud
View on GitHub
This repository contains sample code to help you extend your LSF cluster to the cloud. It provides fully functional examples of how to s…
☆16Dec 17, 2025Updated 7 months ago
cvlab-stonybrook / HandLer
View on GitHub
Forward Propagation, Backward Regression and Pose Association for Hand Tracking in the Wild (CVPR 2022)
☆22Apr 27, 2022Updated 4 years ago
Egorundel / int8_calibrator_cpp
View on GitHub
INT8 calibrator for ONNX model with dynamic batch_size at the input and NMS module at the output. C++ Implementation.
☆19Oct 15, 2024Updated last year
exponentially / helios
View on GitHub
A Building blocks for elixir CQRS segregated applications
☆15Sep 25, 2019Updated 6 years ago
MiguelAngelCalveraUnizar / Mini-SLAM_student
View on GitHub
Student version of Mini-SLAM.
☆10Mar 16, 2024Updated 2 years ago
vc-nju / drfi_python
View on GitHub
DRFI For Region Dissection
☆13Jan 11, 2019Updated 7 years ago