microsoft / msccl-toolsLinks

Synthesizer for optimal collective communication algorithms

☆118

Alternatives and similar repositories for msccl-tools

Users that are interested in msccl-tools are comparing it to the libraries listed below

Sorting:

microsoft / msccl
Microsoft Collective Communication Library
☆367Updated 2 years ago
microsoft / taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
☆76Updated 2 years ago
microsoft / NPKit
NCCL Profiling Kit
☆145Updated last year
parasailteam / coconet
☆83Updated 2 years ago
microsoft / SuperScaler
An experimental parallel training platform
☆54Updated last year
mlcommons / chakra
Repository for MLCommons Chakra schema and tools
☆131Updated 3 weeks ago
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆88Updated 2 years ago
Azure / msccl
Microsoft Collective Communication Library
☆66Updated 10 months ago
calculon-ai / calculon
☆154Updated last year
AlibabaPAI / DAPPLE
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Updated 4 years ago
eniac / paella
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
☆62Updated last year
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆101Updated 2 years ago
mutinifni / splitwise-sim
LLM serving cluster simulator
☆116Updated last year
casys-kaist / glet
☆52Updated 9 months ago
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆144Updated 2 months ago
Mellanox / nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
☆209Updated last month
astra-sim / tacos
TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning
☆27Updated 4 months ago
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆147Updated 8 months ago
casys-kaist / HUVM
☆24Updated 3 years ago
mcrl / tccl
Thunder Research Group's Collective Communication Library
☆42Updated 3 months ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆126Updated 3 years ago
HPDL-Group / Merak
☆81Updated 5 months ago
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆153Updated last week
astra-sim / astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
☆446Updated last month
ConnollyLeon / awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
☆147Updated 3 years ago
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆121Updated last year
Raphael-Hao / brainstorm
Compiler for Dynamic Neural Networks
☆46Updated last year
microsoft / mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
☆425Updated this week
Raphael-Hao / Abacus
☆38Updated 3 months ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆53Updated last year