ai-dynamo / nixlLinks
NVIDIA Inference Xfer Library (NIXL)
☆413Updated this week
Alternatives and similar repositories for nixl
Users that are interested in nixl are comparing it to the libraries listed below
Sorting:
- KV cache store for distributed LLM inference☆261Updated last week
- Perplexity GPU Kernels☆364Updated last week
- Efficient and easy multi-instance LLM serving☆430Updated last week
- ☆49Updated 3 months ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆378Updated this week
- Dynamic Memory Management for Serving LLMs without PagedAttention☆396Updated 2 weeks ago
- A low-latency & high-throughput serving engine for LLMs☆379Updated 2 weeks ago
- Disaggregated serving system for Large Language Models (LLMs).☆614Updated 2 months ago
- Distributed Compiler Based on Triton for Parallel Systems☆829Updated this week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆177Updated last week
- Zero Bubble Pipeline Parallelism☆397Updated last month
- Ultra and Unified CCL☆154Updated this week
- CUDA checkpoint and restore utility☆343Updated 4 months ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- NCCL Profiling Kit☆137Updated 11 months ago
- Microsoft Collective Communication Library☆349Updated last year
- DeepSeek-V3/R1 inference performance simulator☆148Updated 2 months ago
- A tool for bandwidth measurements on NVIDIA GPUs.☆459Updated 2 months ago
- GLake: optimizing GPU memory management and IO transmission.☆467Updated 2 months ago
- Materials for learning SGLang☆435Updated 2 weeks ago
- ☆91Updated 5 months ago
- RDMA and SHARP plugins for nccl library☆195Updated last week
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆85Updated last month
- A throughput-oriented high-performance serving framework for LLMs☆822Updated 2 weeks ago
- A library to analyze PyTorch traces.☆387Updated last week
- ☆26Updated 3 months ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆245Updated last week
- High performance Transformer implementation in C++.☆125Updated 5 months ago
- A PyTorch Native LLM Training Framework☆819Updated 5 months ago
- NVIDIA NCCL Tests for Distributed Training☆93Updated last week