triton-inference-server / triton_distributed
☆48Updated last month
Alternatives and similar repositories for triton_distributed:
Users that are interested in triton_distributed are comparing it to the libraries listed below
- NVIDIA Inference Xfer Library (NIXL)☆282Updated this week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆150Updated this week
- Efficient and easy multi-instance LLM serving☆383Updated this week
- NVIDIA NCCL Tests for Distributed Training☆88Updated this week
- A low-latency & high-throughput serving engine for LLMs☆346Updated this week
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.