triton-inference-server / commonLinks

Common source, scripts and utilities shared across all Triton repositories.

☆77

Alternatives and similar repositories for common

Users that are interested in common are comparing it to the libraries listed below

Sorting:

triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆353Updated 3 weeks ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆163Updated 3 weeks ago
triton-inference-server / core
The core library and APIs implementing the Triton Inference Server.
☆152Updated this week
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆654Updated last week
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆79Updated 3 weeks ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆495Updated last week
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 6 months ago
triton-inference-server / tensorflow_backend
The Triton backend for TensorFlow.
☆53Updated 4 months ago
triton-inference-server / python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
☆651Updated 2 weeks ago
triton-inference-server / perf_analyzer
☆115Updated 3 weeks ago
triton-inference-server / triton_cli
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…
☆70Updated 3 weeks ago
triton-inference-server / vllm_backend
☆302Updated last week
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆162Updated 2 weeks ago
triton-inference-server / dali_backend
The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
☆137Updated 2 weeks ago
microsoft / onnxconverter-common
Common utilities for ONNX converters
☆283Updated last month
triton-inference-server / tutorials
This repository contains tutorials and examples for Triton Inference Server
☆792Updated 3 weeks ago
triton-inference-server / pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
☆824Updated 2 months ago
wangkuiyi / huggingface-tokenizer-in-cxx
☆69Updated 2 years ago
triton-inference-server / paddlepaddle_backend
☆36Updated last year
triton-inference-server / openvino_backend
OpenVINO backend for Triton.
☆34Updated 3 weeks ago
microsoft / onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
☆418Updated last week
neuralmagic / AutoFP8
☆205Updated 5 months ago
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆404Updated last week
mlc-ai / tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
☆400Updated 2 months ago
leimao / Nsight-Compute-Docker-Image
Nsight Compute In Docker
☆12Updated last year
inisis / OnnxSlim
A Toolkit to Help Optimize Onnx Model
☆228Updated this week
npuichigo / openai_trtllm
OpenAI compatible API for TensorRT LLM triton backend
☆216Updated last year
triton-inference-server / fastertransformer_backend
☆413Updated last year
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆295Updated last year
onnx / optimizer
ONNX Optimizer
☆768Updated last week