petuum/adaptdl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/petuum/adaptdl)

petuum / adaptdl

Resource-adaptive cluster scheduler for deep learning training.

☆459

Alternatives and similar repositories for adaptdl

Users that are interested in adaptdl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

petuum / autodist
View on GitHub
Simple Distributed Deep Learning on TensorFlow
☆136Feb 5, 2026Updated 5 months ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated last year
petuum / tuun
View on GitHub
Hyperparameter tuning via uncertainty modeling
☆51May 3, 2024Updated 2 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆215Nov 5, 2020Updated 5 years ago
msr-fiddle / philly-traces
View on GitHub
☆199Aug 31, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
S-Lab-System-Group / Lucid
View on GitHub
Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs
☆61May 21, 2023Updated 3 years ago
SymbioticLab / Tiresias
View on GitHub
Tiresias is a GPU cluster manager for distributed deep learning training.
☆166May 7, 2020Updated 6 years ago
microsoft / hivedscheduler
View on GitHub
Kubernetes Scheduler for Deep Learning
☆263May 22, 2022Updated 4 years ago
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
View on GitHub
☆332Jan 22, 2024Updated 2 years ago
S-Lab-System-Group / HeliosArtifact
View on GitHub
HeliosArtifact
☆22Sep 27, 2022Updated 3 years ago
pengyanghua / optimus
View on GitHub
A Deep Learning Cluster Scheduler
☆36Jan 11, 2021Updated 5 years ago
zhisbug / Cavs
View on GitHub
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Sep 18, 2020Updated 5 years ago
S-Lab-System-Group / HeliosData
View on GitHub
Helios Traces from SenseTime
☆63Sep 27, 2022Updated 3 years ago
S-Lab-System-Group / ChronusArtifact
View on GitHub
☆23Jan 7, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated 11 months ago
microsoft / elasticflow-traces
View on GitHub
Integrated Training Platform (ITP) traces used in ElasticFlow paper.
☆31Dec 23, 2022Updated 3 years ago
msr-fiddle / blox
View on GitHub
☆46Jul 4, 2024Updated 2 years ago
siasosp23 / artifacts
View on GitHub
☆24Aug 15, 2023Updated 2 years ago
gudiandian / ElasticFlow
View on GitHub
☆17May 10, 2024Updated 2 years ago
msr-fiddle / synergy
View on GitHub
☆54Dec 13, 2022Updated 3 years ago
pkusys / ElasticFlow
View on GitHub
Artifacts for our ASPLOS'23 paper ElasticFlow
☆56May 10, 2024Updated 2 years ago
Rivendile / Muri
View on GitHub
Artifacts for our SIGCOMM'22 paper Muri
☆44Dec 29, 2023Updated 2 years ago
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
elasticdeeplearning / edl
View on GitHub
Elastic Deep Learning for deep learning framework on Kubernetes
☆176Jul 5, 2023Updated 3 years ago
lsds / KungFu
View on GitHub
Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.
☆295Feb 23, 2024Updated 2 years ago
sql-machine-learning / elasticdl
View on GitHub
Kubernetes-native Deep Learning Framework
☆744Jan 26, 2024Updated 2 years ago
pengyanghua / DL2
View on GitHub
a deep learning-driven scheduler for elastic training in deep learning clusters
☆31Jan 14, 2021Updated 5 years ago
kzhang28 / Optimus
View on GitHub
An Efficient Dynamic Resource Scheduler for Deep Learning Clusters
☆41Oct 28, 2017Updated 8 years ago
Raphael-Hao / Abacus
View on GitHub
☆38Jun 27, 2025Updated last year
ray-project / ray_shuffling_data_loader
View on GitHub
A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…
☆18Jan 5, 2023Updated 3 years ago
geoffxy / habitat
View on GitHub
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆63Nov 26, 2022Updated 3 years ago
alpa-projects / alpa
View on GitHub
Training and serving large-scale neural networks with auto parallelization.
☆3,178Dec 9, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uw-mad-dash / shockwave
View on GitHub
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆46Nov 24, 2022Updated 3 years ago
asyml / stave
View on GitHub
An extensible framework for building visualization and annotation tools to enable better interaction with NLP and Artificial Intelligence…
☆51Feb 4, 2023Updated 3 years ago
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago
SymbioticLab / ModelKeeper
View on GitHub
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆36Jan 9, 2023Updated 3 years ago
alibaba / clusterdata
View on GitHub
cluster data collected from production clusters in Alibaba for cluster management research
☆2,115Jun 3, 2026Updated last month
microsoft / varuna
View on GitHub
☆250Jul 25, 2024Updated last year
yylin1 / papers-notebook-with-scheduling
View on GitHub
碩士論文文獻筆記（Deep Learning、Scheduling、Distributed、Kubernetes）
☆51May 5, 2019Updated 7 years ago