awslabs/optimizing-multitask-training-through-dynamic-pipelines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/awslabs/optimizing-multitask-training-through-dynamic-pipelines)

awslabs / optimizing-multitask-training-through-dynamic-pipelines

Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines

☆19

Alternatives and similar repositories for optimizing-multitask-training-through-dynamic-pipelines

Users that are interested in optimizing-multitask-training-through-dynamic-pipelines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
alibaba / hap
View on GitHub
☆16Apr 13, 2024Updated 2 years ago
JF-D / Parcae
View on GitHub
☆22Apr 22, 2024Updated 2 years ago
siasosp23 / artifacts
View on GitHub
☆24Aug 15, 2023Updated 2 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / nnscaler
View on GitHub
nnScaler: Compiling DNN models for Parallel Training
☆135Jul 2, 2026Updated 3 weeks ago
raywan-110 / AdaQP
View on GitHub
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training
☆24Mar 1, 2024Updated 2 years ago
thunlp / Seq1F1B
View on GitHub
Sequence-level 1F1B schedule for LLMs.
☆37Aug 26, 2025Updated 10 months ago
UMass-LIDS / Proteus
View on GitHub
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Mar 7, 2024Updated 2 years ago
tonyzhao-jt / LLM-PQ
View on GitHub
Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …
☆39Aug 29, 2025Updated 10 months ago
jasperzhong / GNNFlow
View on GitHub
Distributed Deep Graph Learning Framework for Dynamic Graphs
☆19Mar 25, 2024Updated 2 years ago
SymbioticLab / Oobleck
View on GitHub
A resilient distributed training framework
☆100Apr 11, 2024Updated 2 years ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pku-liang / MAGIS
View on GitHub
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆57May 29, 2024Updated 2 years ago
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
SDS-Lab / QW_Loss
View on GitHub
A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)
☆10May 20, 2024Updated 2 years ago
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
ChandlerGuan / mercury_artifact
View on GitHub
☆27Oct 1, 2025Updated 9 months ago
microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
liu445126256 / FuncPipe
View on GitHub
☆11Jul 9, 2023Updated 3 years ago
harshanarayana / kube-scheduler
View on GitHub
Custom Python Scheduler for Kubernetes
☆15Jan 25, 2020Updated 6 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
saareliad / FTPipe
View on GitHub
FTPipe and related pipeline model parallelism research.
☆44May 16, 2023Updated 3 years ago
PeterSH6 / MSPipe
View on GitHub
☆16Feb 20, 2024Updated 2 years ago
MAC-AutoML / MotionCache
View on GitHub
[ICML 2026]
☆17Jul 4, 2026Updated 2 weeks ago
meta-pytorch / remat
View on GitHub
torch_remat fine-grained activation checkpointing API
☆14Updated this week
S-Lab-System-Group / Hydro
View on GitHub
Surrogate-based Hyperparameter Tuning System
☆30Jun 29, 2023Updated 3 years ago
zrt / thu-xiyi
View on GitHub
清华大学宿舍洗衣机空闲提醒小程序
☆14Feb 4, 2021Updated 5 years ago
spcl / crosspipe
View on GitHub
Official implementation of CrossPipe: Towards Optimal Pipeline Schedules for Cross-Datacenter Training (ATC '25), built on top of Megatro…
☆17Jul 6, 2025Updated last year
jasperzhong / swift
View on GitHub
☆15Apr 20, 2022Updated 4 years ago
YerbaPage / plagiarism-certification-helper
View on GitHub
a simple script to detect word by word plagiarism for https://plagiarism.iu.edu/certificationTests/
☆19Feb 22, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
unist-ssl / IIDP
View on GitHub
☆13Apr 7, 2025Updated last year
bertmaher / tf32_gemm
View on GitHub
Example of binding a TF32 CUTLASS GEMM kernel to PyTorch
☆12Jun 7, 2024Updated 2 years ago
lsds / Tempo
View on GitHub
Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning
☆30Oct 21, 2025Updated 9 months ago
msr-fiddle / synergy
View on GitHub
☆54Dec 13, 2022Updated 3 years ago
Huntersxsx / SJTU_2020_Spring
View on GitHub
上海交通大学2020春研究生的部分课程作业整理
☆16Jun 14, 2020Updated 6 years ago
volcengine / veScale
View on GitHub
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆1,031Mar 3, 2026Updated 4 months ago
pkusys / ElasticFlow
View on GitHub
Artifacts for our ASPLOS'23 paper ElasticFlow
☆56May 10, 2024Updated 2 years ago