facebookresearch / nccl
Optimized primitives for collective multi-GPU communication
☆21Updated 11 months ago
Alternatives and similar repositories for nccl:
Users that are interested in nccl are comparing it to the libraries listed below
- NCCL Profiling Kit☆129Updated 9 months ago
- Unified Collective Communication Library☆246Updated this week
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- Pytorch process group third-party plugin for UCC☆20Updated 11 months ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆134Updated this week
- GPUDirect Async support for IB Verbs☆109Updated 2 years ago
- pytorch ucc plugin☆21Updated 3 years ago
- RDMA and SHARP plugins for nccl library☆187Updated this week
- RCCL Performance Benchmark Tests☆60Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆168Updated this week
- Multi-GPU communication profiler and visualizer☆28Updated 10 months ago
- A hierarchical collective communications library with portable optimizations☆33Updated 4 months ago
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- Synthesizer for optimal collective communication algorithms☆105Updated last year
- ☆36Updated 4 months ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- GVProf: A Value Profiler for GPU-based Clusters☆49Updated last year
- ☆20Updated 3 weeks ago
- ROCm Communication Collectives Library (RCCL)☆317Updated this week
- Microsoft Collective Communication Library☆65Updated 4 months ago
- Microsoft Collective Communication Library☆342Updated last year
- ☆23Updated 3 years ago
- A command line utility to manage the configuration of a system's high performance network interfaces for RoCE deployments☆29Updated last year
- CloudAI Benchmark Framework☆60Updated this week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆139Updated last week
- ☆58Updated 2 months ago
- Magnum IO community repo☆89Updated 2 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated last week
- DeepSeek-V3/R1 inference performance simulator☆106Updated 2 weeks ago
- Repository for MLCommons Chakra schema and tools☆95Updated last month