☆33Jul 7, 2022Updated 3 years ago
Alternatives and similar repositories for Triton-Inference-Server-on-Kubernetes
Users that are interested in Triton-Inference-Server-on-Kubernetes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton backend for managing the model state tensors automatically in sequence batcher☆17Feb 12, 2024Updated 2 years ago
- ☆19Jan 19, 2024Updated 2 years ago
- How to deploy open source models using DeepStream and Triton Inference Server☆86Jun 27, 2024Updated last year
- ☆25Oct 10, 2022Updated 3 years ago
- Implementation of End-to-End YOLO Models☆10Dec 30, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Triton Inferece Server Model Config and Client Scripts☆31Jan 7, 2022Updated 4 years ago
- Generic data structures and utility types in Go☆28Updated this week
- Uploads Frigate clips to Google Drive using MQTT & Frigate API☆12May 19, 2026Updated 3 weeks ago
- PPE detection of helmets(construction) using Nvidia Deepstream. Model trained using Nvidia TLT.☆11Jun 27, 2021Updated 4 years ago
- Simple example of FastAPI + Celery + Triton for benchmarking☆65Aug 11, 2022Updated 3 years ago
- ☆11Apr 13, 2019Updated 7 years ago
- Transformer related optimization, including BERT, GPT☆14Jun 27, 2023Updated 2 years ago
- ROI-based Instance Segmentation for Human Detection (CNN)☆23Oct 7, 2025Updated 8 months ago
- Using TensorRT and Triton Server to build BERT model as a service☆13Jan 10, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ECCV 2020 paper (Spotlight) "Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object …☆12Apr 22, 2021Updated 5 years ago
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 4 years ago
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- StrongSORT with Selective Feature Extraction Mechanism☆16Sep 25, 2024Updated last year
- ☆11Oct 17, 2023Updated 2 years ago
- Accelerates creation of an inventory management solution for product manufactures and suppliers☆17Jul 9, 2023Updated 2 years ago
- Cattle identification is a major problem in classical animal identification methods ( ear-tagging, tattoo, freeze branding, and embeded …☆15Nov 29, 2018Updated 7 years ago
- Implementation of the content-aware image resizing algorithm presented in the paper "Seam carving for content-aware image resizing"☆13Jul 22, 2019Updated 6 years ago
- At the point when we started this project, election week is coming up. There was so much excitement in the air on who is the next US pres…☆14Dec 13, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13May 11, 2021Updated 5 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- An agent that can run everywhere - even in your watch!☆33Apr 8, 2026Updated 2 months ago
- Student version of Mini-SLAM.☆10Mar 16, 2024Updated 2 years ago
- ☆17Nov 23, 2023Updated 2 years ago
- DRFI For Region Dissection☆13Jan 11, 2019Updated 7 years ago
- Controlling the MicriSpotAI robot from scratch☆14Dec 30, 2021Updated 4 years ago
- INT8 calibrator for ONNX model with dynamic batch_size at the input and NMS module at the output. C++ Implementation.☆18Oct 15, 2024Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆17Sep 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆144Jun 9, 2026Updated last week
- deepstream python play rtsp h264☆14Jan 10, 2022Updated 4 years ago
- Elixir-based Event Source server-side implementation using Phoenix Pubsub☆18Nov 25, 2020Updated 5 years ago
- An ncnn-vulkan implementation of YOLOv5, capable of using GPU to accelerate inference☆14Nov 18, 2021Updated 4 years ago
- FastAPI middleware for comparing different ML model serving approaches☆15Jul 5, 2023Updated 2 years ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago