triton-inference-server / paddlepaddle_backendLinks

☆36

Alternatives and similar repositories for paddlepaddle_backend

Users that are interested in paddlepaddle_backend are comparing it to the libraries listed below

Sorting:

torchpipe / torchpipe
Serving Inside Pytorch
☆165Updated 2 weeks ago
bug-developer021 / YOLOV5_optimization_on_triton
Compare multiple optimization methods on triton to imporve model service performance
☆52Updated last year
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆139Updated 3 weeks ago
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆79Updated 3 weeks ago
TRT2022 / trtllm-llama
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆50Updated 2 years ago
triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆361Updated 3 weeks ago
PaddlePaddle / Paddle-Inference-Demo
☆268Updated 2 weeks ago
FeiGeChuanShu / trt2023
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆43Updated 2 years ago
modelbox-ai / modelbox
A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架，快速基于AI全栈服务、开发跨端边云的AI行业应用，支持GPU，…
☆160Updated last year
MACNICA-CLAVIS-NV / yolov5-triton
YOLO v5 Object Detection on Triton Inference Server
☆16Updated 2 years ago
Tencent / TPAT
TensorRT Plugin Autogen Tool
☆368Updated 2 years ago
PaddlePaddle / PLSC
Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT,…
☆155Updated 2 years ago
inisis / OnnxLLM
Large Language Model Onnx Inference Framework
☆36Updated last week
PaddlePaddle / benchmark
☆80Updated last month
PaddlePaddle / community
PaddlePaddle Developer Community
☆127Updated last week
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆665Updated last week
Oneflow-Inc / oneflow_convert
OneFlow->ONNX
☆43Updated 2 years ago
tsingmicro-toolchain / OnnxSlim
A Toolkit to Help Optimize Large Onnx Model
☆162Updated last month
BBuf / onnx_learn
☆102Updated 4 years ago
zzk0 / triton
Triton Inferece Server Model Config and Client Scripts
☆32Updated 3 years ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 7 months ago
PaddlePaddle / PaddleCustomDevice
PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)
☆100Updated last week
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆168Updated this week
Bobo-y / triton_ensemble_model_demo
triton server ensemble model demo
☆30Updated 3 years ago
Oldpan / DeployIsAllYouNeed
☆120Updated 2 years ago
isarsoft / yolov4-triton-tensorrt
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server
☆286Updated 3 years ago
triton-inference-server / tensorflow_backend
The Triton backend for TensorFlow.
☆55Updated 2 weeks ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆287Updated 3 months ago
Tlntin / trt2023
☆26Updated 2 years ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆499Updated last week