Mellanox / nccl-rdma-sharp-pluginsLinks

RDMA and SHARP plugins for nccl library

☆199

Alternatives and similar repositories for nccl-rdma-sharp-plugins

Users that are interested in nccl-rdma-sharp-plugins are comparing it to the libraries listed below

Sorting:

google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆118Updated last year
Mellanox / nv_peer_memory
☆361Updated last year
microsoft / NPKit
NCCL Profiling Kit
☆139Updated last year
microsoft / msccl
Microsoft Collective Communication Library
☆352Updated last year
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆110Updated last year
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆147Updated last week
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆143Updated this week
gpudirect / libgdsync
GPUDirect Async support for IB Verbs
☆128Updated 2 years ago
Mellanox / gpu_direct_rdma_access
example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory
☆137Updated last year
aws / aws-ofi-nccl
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
☆181Updated last week
microsoft / mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆390Updated this week
NVIDIA / nvbandwidth
A tool for bandwidth measurements on NVIDIA GPUs.
☆492Updated 3 months ago
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
openucx / ucc
Unified Collective Communication Library
☆262Updated this week
microsoft / superbenchmark
A validation and profiling tool for AI infrastructure
☆325Updated last week
mlcommons / chakra
Repository for MLCommons Chakra schema and tools
☆114Updated last month
uccl-project / uccl
Ultra and Unified CCL
☆440Updated this week
Azure / msccl-executor-nccl
☆37Updated 7 months ago
facebookresearch / torch_ucc
Pytorch process group third-party plugin for UCC
☆21Updated last year
Azure / msccl
Microsoft Collective Communication Library
☆63Updated 8 months ago
uxlfoundation / oneCCL
oneAPI Collective Communications Library (oneCCL)
☆241Updated 3 weeks ago
coreweave / nccl-tests
NVIDIA NCCL Tests for Distributed Training
☆100Updated last week
Bruce-Lee-LY / cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
☆160Updated last year
ai-dynamo / nixl
NVIDIA Inference Xfer Library (NIXL)
☆491Updated this week
microsoft / taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆74Updated 2 years ago
mcrl / tccl
Thunder Research Group's Collective Communication Library
☆39Updated 3 weeks ago
openucx / xccl
☆25Updated 4 years ago
NVIDIA / MagnumIO
Magnum IO community repo
☆95Updated 2 months ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127Updated 3 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆60Updated last year