triton-inference-server / developer_tools
☆18Updated this week
Alternatives and similar repositories for developer_tools:
Users that are interested in developer_tools are comparing it to the libraries listed below
- The Triton backend for TensorRT.☆68Updated this week
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆132Updated this week
- The Triton backend for the ONNX Runtime.☆138Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆196Updated last month
- Model compression for ONNX☆84Updated 2 months ago
- Demonstration of the use of TensorRT and TRITON☆16Updated 4 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 8 months ago
- ☆30Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- OneFlow->ONNX☆42Updated last year
- OpenVINO backend for Triton.☆30Updated this week
- ☆43Updated this week
- ☆10Updated 3 years ago
- The Triton backend for TensorFlow.☆49Updated this week
- ☆55Updated last year
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆64Updated 2 years ago
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- Common source, scripts and utilities for creating Triton backends.☆307Updated this week
- The Triton backend for the PyTorch TorchScript models.☆143Updated this week
- ☆69Updated last year
- ☆33Updated last year
- ☆9Updated 2 years ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆70Updated this week
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆53Updated this week
- ☆24Updated 2 years ago
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- ☆42Updated 4 years ago
- ONNX Command-Line Toolbox☆35Updated 4 months ago