Bobo-y / triton_ensemble_model_demoLinks
triton server ensemble model demo
☆30Updated 3 years ago
Alternatives and similar repositories for triton_ensemble_model_demo
Users that are interested in triton_ensemble_model_demo are comparing it to the libraries listed below
Sorting:
- ☆53Updated 3 years ago
- ☆53Updated 3 years ago
- 将Yolov3模型转成可以进行动态Batch的TensorRT推理以及Triton Inference Serving上部署的TensorRT模型☆28Updated 4 years ago
- This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server☆286Updated 3 years ago
- Compare multiple optimization methods on triton to imporve model service performance☆52Updated last year
- Retinaface get 80.99% in widerface hard val using mobilenet0.25.☆25Updated 5 years ago
- Implement yolov5 with Tensorrt C++ api, and integrate batchedNMSPlugin. A Python wrapper is also provided.☆49Updated 3 years ago
- ☆25Updated 4 years ago
- Decode JPEG image on GPU using PyTorch☆91Updated last year
- TensorRT plugin forDCNv2 layer in ONNX model☆60Updated 4 years ago
- NVIDIA-阿里2021 TRT比赛 `二等奖` 代码提交 团队:美 迪康 AI Lab☆171Updated 3 years ago
- deploy yolox algorithm use deepstream☆89Updated 3 years ago
- Using TensorRT for Inference Model Deployment.☆49Updated last year
- Face Recognition with RetinaFace and ArcFace.☆85Updated 3 years ago
- How to deploy open source models using DeepStream and Triton Inference Server☆81Updated last year
- Using GAN magic to generate more realistic license plates☆77Updated 5 years ago
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆80Updated 3 years ago
- A project demonstrating how to use nvmetamux to run multiple models in parallel.☆104Updated 9 months ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆136Updated last week
- arcface and retinaface model convert mxnet to onnx.☆61Updated 4 years ago
- A tensorrt implementation of yolov5: https://github.com/ultralytics/yolov5☆190Updated 4 years ago
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Updated 3 years ago
- psenet,prune model, text detection☆17Updated 5 years ago
- This repository provides YOLOV5 GPU optimization sample☆106Updated 2 years ago
- centernet, mobilenetv2, centerface☆52Updated 5 years ago
- This is 8-bit quantization sample for yolov5. Both PTQ, QAT and Partial Quantization have been implemented, and present the results based…☆105Updated 3 years ago
- ☆63Updated 4 years ago
- A multi object tracking Library Based on tensorrt☆54Updated 3 years ago
- A pytorch to tensorrt convert with dynamic shape support☆263Updated last year
- ☆16Updated 3 years ago