triton-inference-server / developer_toolsLinks

☆21

Alternatives and similar repositories for developer_tools

Users that are interested in developer_tools are comparing it to the libraries listed below

Sorting:

triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆139Updated last week
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆79Updated last week
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆162Updated last week
meta-pytorch / tokenizers
C++ implementations for various tokenizers (sentencepiece, tiktoken etc).
☆36Updated last week
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆212Updated 5 months ago
triton-inference-server / tensorflow_backend
The Triton backend for TensorFlow.
☆53Updated 4 months ago
onnx / neural-compressor
Model compression for ONNX
☆97Updated 11 months ago
triton-inference-server / fil_backend
FIL backend for the Triton Inference Server
☆83Updated last week
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆282Updated last month
triton-inference-server / openvino_backend
OpenVINO backend for Triton.
☆34Updated last week
triton-inference-server / common
Common source, scripts and utilities shared across all Triton repositories.
☆76Updated last week
triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆351Updated last week
scailable / sclblonnx
Scailable ONNX python tools
☆97Updated 11 months ago
gmalivenko / onnx-opcounter
Count number of parameters / MACs / FLOPS for ONNX models.
☆94Updated 11 months ago
NVIDIA-AI-IOT / NVIDIA-Optical-Character-Detection-and-Recognition-Solution
This repository provides optical character detection and recognition solution optimized on Nvidia devices.
☆81Updated 5 months ago
lucasjinreal / wanwu_release
Wanwu models release, code will be released soon
☆24Updated 3 years ago
triton-inference-server / paddlepaddle_backend
☆36Updated last year
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆160Updated this week
YH-Wu / Triton-Inference-Server-on-Kubernetes
☆33Updated 3 years ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆494Updated last week
meta-pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated last month
sdpython / mlprodict
Productionize machine learning predictions, with ONNX or without
☆66Updated last year
k9ele7en / Triton-TensorRT-Inference-CRAFT-pytorch
Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…
☆33Updated 4 years ago
NVIDIA / nvImageCodec
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
☆119Updated 2 months ago
triton-inference-server / triton_cli
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…
☆70Updated last week
NVIDIA-AI-IOT / tao-toolkit-triton-apps
Sample app code for deploying TAO Toolkit trained models to Triton
☆89Updated last year
mlflow / mlflow-torchserve
Plugin for deploying MLflow models to TorchServe
☆110Updated 2 years ago
NVIDIA-AI-IOT / deepstream_triton_model_deploy
How to deploy open source models using DeepStream and Triton Inference Server
☆85Updated last year
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆114Updated 2 years ago
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆365Updated 7 months ago