RDMA and SHARP plugins for nccl library
☆227Apr 3, 2026Updated last month
Alternatives and similar repositories for nccl-rdma-sharp-plugins
Users that are interested in nccl-rdma-sharp-plugins are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NCCL Profiling Kit☆153Jul 1, 2024Updated last year
- ☆394Apr 23, 2024Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆122Nov 15, 2023Updated 2 years ago
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- ☆26May 19, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Microsoft Collective Communication Library☆389Sep 20, 2023Updated 2 years ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,373Mar 12, 2026Updated last month
- Synthesizer for optimal collective communication algorithms☆123Apr 8, 2024Updated 2 years ago
- NCCL Tests☆1,505Apr 13, 2026Updated 3 weeks ago
- Optimized primitives for collective multi-GPU communication☆4,656Apr 28, 2026Updated last week
- Unified Collective Communication Library☆306Apr 22, 2026Updated 2 weeks ago
- ☆47Dec 13, 2024Updated last year
- A tool for bandwidth measurements on NVIDIA GPUs.☆689Apr 8, 2026Updated 3 weeks ago
- GPUDirect Async support for IB Verbs☆137Nov 10, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- oneAPI Collective Communications Library (oneCCL)☆264Apr 23, 2026Updated last week
- Infiniband Verbs Performance Tests☆952Apr 15, 2026Updated 3 weeks ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆211Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆418Apr 28, 2026Updated last week
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆80Jul 25, 2023Updated 2 years ago
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆62Jul 1, 2022Updated 3 years ago
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆507Updated this week
- ☆84Dec 2, 2022Updated 3 years ago
- Python bindings for UCX☆139Sep 18, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)☆1,626Updated this week
- Venus Collective Communication Library, supported by SII and Infrawaves.☆144Apr 22, 2026Updated 2 weeks ago
- Collective communications library with various primitives for multi-machine training.☆1,422Apr 21, 2026Updated 2 weeks ago
- Thunder Research Group's Collective Communication Library☆52Jul 8, 2025Updated 9 months ago
- RDMA core userspace libraries and daemons☆2,213Apr 20, 2026Updated 2 weeks ago
- Pytorch process group third-party plugin for UCC☆21Apr 15, 2024Updated 2 years ago
- ☆26Feb 17, 2025Updated last year
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆155Jul 30, 2024Updated last year
- NVIDIA Inference Xfer Library (NIXL)☆1,011Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆31Oct 22, 2025Updated 6 months ago
- ☆54Feb 1, 2026Updated 3 months ago
- NVIDIA NCCL Tests for Distributed Training☆144Apr 29, 2026Updated last week
- A tutorial on RDMA based programming using code examples☆617Jan 3, 2020Updated 6 years ago
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆884Sep 26, 2025Updated 7 months ago
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,297Aug 28, 2025Updated 8 months ago