triton-inference-server / coreLinks

The core library and APIs implementing the Triton Inference Server.

☆152

Alternatives and similar repositories for core

Users that are interested in core are comparing it to the libraries listed below

Sorting:

triton-inference-server / backend
Common source, scripts and utilities for creating Triton backends.
☆352Updated 2 weeks ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆162Updated 2 weeks ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆495Updated this week
triton-inference-server / common
Common source, scripts and utilities shared across all Triton repositories.
☆76Updated last week
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 6 months ago
triton-inference-server / perf_analyzer
☆115Updated 2 weeks ago
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆654Updated last week
NVIDIA-Merlin / HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…
☆175Updated this week
bytedance / InfiniStore
KV cache store for distributed LLM inference
☆346Updated last month
triton-inference-server / python_backend
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
☆648Updated 2 weeks ago
ai-dynamo / nixl
NVIDIA Inference Xfer Library (NIXL)
☆688Updated this week
triton-inference-server / tensorrt_backend
The Triton backend for TensorRT.
☆79Updated 2 weeks ago
DeepRec-AI / HybridBackend
A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster
☆159Updated last year
alibaba / TePDist
TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.
☆97Updated 2 years ago
NVIDIA / DCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
☆601Updated 2 weeks ago
triton-inference-server / triton_cli
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…
☆70Updated 2 weeks ago
bytedance / ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…
☆265Updated 2 months ago
triton-inference-server / fastertransformer_backend
☆413Updated last year
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆358Updated this week
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆162Updated last week
AlibabaPAI / torchacc
PyTorch distributed training acceleration framework
☆53Updated 2 months ago
triton-inference-server / vllm_backend
☆302Updated this week
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆121Updated last year
triton-inference-server / hugectr_backend
☆56Updated 2 years ago
bytedance / ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Updated last year
Oneflow-Inc / DLPerf
DeepLearning Framework Performance Profiling Toolkit
☆292Updated 3 years ago
triton-inference-server / tensorflow_backend
The Triton backend for TensorFlow.
☆53Updated 4 months ago
mlc-ai / tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
☆400Updated 2 months ago
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆269Updated 2 years ago
NVIDIA / nvbandwidth
A tool for bandwidth measurements on NVIDIA GPUs.
☆553Updated 6 months ago