microsoft / msccl
Microsoft Collective Communication Library
☆343Updated last year
Alternatives and similar repositories for msccl:
Users that are interested in msccl are comparing it to the libraries listed below
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆311Updated this week
- NCCL Profiling Kit☆127Updated 8 months ago
- Synthesizer for optimal collective communication algorithms☆106Updated 11 months ago
- RDMA and SHARP plugins for nccl library☆183Updated last month
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆116Updated last year
- Microsoft Collective Communication Library☆60Updated 3 months ago
- ☆75Updated 2 years ago
- ☆131Updated last year
- A baseline repository of Auto-Parallelism in Training Neural Networks☆143Updated 2 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆71Updated last year
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆74Updated 4 years ago
- Shared Middle-Layer for Triton Compilation☆232Updated last week
- nnScaler: Compiling DNN models for Parallel Training☆101Updated last month
- A tool for bandwidth measurements on NVIDIA GPUs.☆391Updated last month
- Repository for MLCommons Chakra schema and tools☆92Updated last week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆132Updated this week
- Dynamic Memory Management for Serving LLMs without PagedAttention☆317Updated this week
- ☆330Updated 10 months ago
- ☆79Updated 3 months ago
- An experimental parallel training platform☆54Updated 11 months ago
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆327Updated 3 weeks ago
- collection of benchmarks to measure basic GPU capabilities☆308Updated last month
- An interference-aware scheduler for fine-grained GPU sharing☆127Updated last month
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆153Updated 5 months ago
- A validation and profiling tool for AI infrastructure☆302Updated this week
- Unified Collective Communication Library☆237Updated this week
- LLM serving cluster simulator☆93Updated 10 months ago
- Zero Bubble Pipeline Parallelism☆370Updated 2 weeks ago
- ROCm Communication Collectives Library (RCCL)☆305Updated this week