msr-fiddle / pipedreamLinks

☆393

Alternatives and similar repositories for pipedream

Users that are interested in pipedream are comparing it to the libraries listed below

Sorting:

AlibabaPAI / DAPPLE
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Updated 4 years ago
microsoft / msccl
Microsoft Collective Communication Library
☆367Updated 2 years ago
SymbioticLab / Salus
Fine-grained GPU sharing primitives
☆144Updated 2 months ago
petuum / adaptdl
Resource-adaptive cluster scheduler for deep learning training.
☆448Updated 2 years ago
netx-repo / PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆126Updated 3 years ago
microsoft / nnfusion
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
☆994Updated last year
snuspl / nimble
Lightweight and Parallel Deep Learning Framework
☆264Updated 2 years ago
mit-han-lab / inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆199Updated 3 years ago
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆118Updated last year
alibaba / GPU-scheduler-for-deep-learning
GPU-scheduler-for-deep-learning
☆210Updated 4 years ago
ConnollyLeon / awesome-Auto-Parallelism
A baseline repository of Auto-Parallelism in Training Neural Networks
☆147Updated 3 years ago
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆273Updated 2 months ago
HPDL-Group / Merak
☆81Updated 5 months ago
kakaobrain / torchgpipe
A GPipe implementation in PyTorch
☆857Updated last year
stanford-mast / INFaaS
Model-less Inference Serving
☆92Updated last year
jiazhihao / TASO
The Tensor Algebra SuperOptimizer for Deep Learning
☆730Updated 2 years ago
lsds / KungFu
Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.
☆298Updated last year
uwsampl / nexus
☆83Updated 4 months ago
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆269Updated 2 years ago
stanford-futuredata / gavel
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆130Updated last year
tbd-ai / tbd-suite
☆47Updated 2 years ago
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆62Updated 2 years ago
petuum / autodist
Simple Distributed Deep Learning on TensorFlow
☆134Updated 4 months ago
google / nccl-fastsocket
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
☆121Updated last year
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆88Updated 2 years ago
calculon-ai / calculon
☆154Updated last year
linnanwang / superneurons-release
this is the release repository of superneurons
☆53Updated 4 years ago
baidu-research / baidu-allreduce
☆599Updated 7 years ago
msr-fiddle / philly-traces
☆195Updated 6 years ago
awslabs / raf
☆145Updated 8 months ago