Bobo-y / triton_ensemble_model_demo
triton server ensemble model demo
☆30Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for triton_ensemble_model_demo
- ☆53Updated 2 years ago
- ☆53Updated 2 years ago
- Compare multiple optimization methods on triton to imporve model service performance☆46Updated 10 months ago
- ☆21Updated 2 years ago
- ☆16Updated 2 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆32Updated 3 years ago
- Retinaface get 80.99% in widerface hard val using mobilenet0.25.☆22Updated 4 years ago
- ☆25Updated 3 years ago
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆27Updated 3 years ago
- Implement yolov5 with Tensorrt C++ api, and integrate batchedNMSPlugin. A Python wrapper is also provided.☆50Updated 3 years ago
- A project demonstrating how to use nvmetamux to run multiple models in parallel.☆95Updated last month
- How to deploy open source models using DeepStream and Triton Inference Server☆74Updated 4 months ago
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆28Updated last year
- TensorRT plugin forDCNv2 layer in ONNX model☆58Updated 4 years ago
- YOLO v5 Object Detection on Triton Inference Server☆14Updated last year
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆279Updated 2 years ago
- A multi object tracking Library Based on tensorrt☆52Updated 3 years ago
- resize image in (CUDA, python, cupy)☆38Updated last year
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆82Updated 2 weeks ago
- deploy yolox algorithm use deepstream☆89Updated 2 years ago
- Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'☆94Updated last year
- Mobile Detection Benchmark☆44Updated 2 years ago
- This repository provides YOLOV5 GPU optimization sample☆100Updated last year
- Magface Triton Inferece Server Using Tensorrt☆15Updated 2 years ago
- NVIDIA-阿里2021 TRT比赛 `二等奖` 代码提交 团队:美迪康 AI Lab☆164Updated 2 years ago
- async inference for machine learning model☆26Updated 2 years ago
- ☆24Updated 3 years ago
- Using TensorRT for Inference Model Deployment.☆47Updated 10 months ago