mlsys-seo / ooo-backpropLinks

☆25

Alternatives and similar repositories for ooo-backprop

Users that are interested in ooo-backprop are comparing it to the libraries listed below

Sorting:

Sys-KU / DeepPlan
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆57Updated 3 months ago
VIA-Research / vTrain
☆73Updated 5 months ago
swsnu / aisys2023
☆103Updated 2 years ago
unist-ssl / IIDP
☆13Updated 7 months ago
casys-kaist / EnvPipe
☆25Updated 2 years ago
DachengLi1 / AMP
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆42Updated 3 years ago
saareliad / FTPipe
FTPipe and related pipeline model parallelism research.
☆43Updated 2 years ago
parasailteam / coconet
☆83Updated 2 years ago
ConstantPark / DL_Compiler
Study Group of Deep Learning Compiler
☆165Updated 2 years ago
snuspl / nimble
Lightweight and Parallel Deep Learning Framework
☆263Updated 2 years ago
friendliai / LLMServingPerfEvaluator
☆48Updated last year
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Updated last year
zhuohan123 / terapipe
☆77Updated 4 years ago
swsnu / bd2018
☆24Updated 7 years ago
microsoft / msccl-tools
Synthesizer for optimal collective communication algorithms
☆119Updated last year
ParCIS / Chimera
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆68Updated 8 months ago
msr-fiddle / CheckFreq
☆57Updated 4 years ago
SymbioticLab / Oobleck
A resilient distributed training framework
☆96Updated last year
alpa-projects / mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
☆91Updated 2 years ago
mcrl / tccl
Thunder Research Group's Collective Communication Library
☆42Updated 4 months ago
casys-kaist / glet
☆53Updated 10 months ago
Azure / msccl
Microsoft Collective Communication Library
☆66Updated last year
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆152Updated 9 months ago
friendliai / periflow-cli
Welcome to PeriFlow CLI ☁︎
☆12Updated 2 years ago
casys-kaist / casys-kaist.github.io
☆18Updated 3 weeks ago
uclasystem / bamboo
Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
☆53Updated 2 years ago
HuaizhengZhang / MIGProfiler
Multi-Instance-GPU profiling tool
☆60Updated 2 years ago
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆62Updated 2 years ago
UofT-EcoSystem / hotline
☆32Updated 2 years ago
EMDC-OS / power-aware-triton
Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)
☆13Updated 5 months ago