zhuohan123/terapipe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuohan123/terapipe)

zhuohan123 / terapipe

☆78

Alternatives and similar repositories for terapipe

Users that are interested in terapipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

saareliad / FTPipe
View on GitHub
FTPipe and related pipeline model parallelism research.
☆44May 16, 2023Updated 3 years ago
microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
DachengLi1 / AMP
View on GitHub
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆44Nov 4, 2022Updated 3 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
HPDL-Group / Merak
View on GitHub
☆86Feb 11, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sail-sg / zero-bubble-pipeline-parallelism
View on GitHub
Zero Bubble Pipeline Parallelism
☆459May 7, 2025Updated last year
AlibabaPAI / DAPPLE
View on GitHub
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Dec 11, 2020Updated 5 years ago
RulinShao / FastCkpt
View on GitHub
Python package for rematerialization-aware gradient checkpointing
☆27Oct 31, 2023Updated 2 years ago
msr-fiddle / dnn-partitioning
View on GitHub
☆42Oct 12, 2020Updated 5 years ago
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
llumnix-project / llumnix-ray
View on GitHub
Efficient and easy multi-instance LLM serving
☆559Mar 12, 2026Updated 3 months ago
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 10 months ago
feifeibear / long-context-attention
View on GitHub
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆675May 21, 2026Updated last month
octoml / synr
View on GitHub
A library for syntactically rewriting Python programs, pronounced (sinner).
☆66Feb 22, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mit-han-lab / inter-operator-scheduler
View on GitHub
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆200Apr 27, 2022Updated 4 years ago
microsoft / nnscaler
View on GitHub
nnScaler: Compiling DNN models for Parallel Training
☆132Jun 10, 2026Updated 3 weeks ago
marius-team / marius
View on GitHub
Large scale graph learning on a single machine.
☆167Feb 25, 2025Updated last year
RulinShao / LightSeq
View on GitHub
Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Training
☆223Aug 19, 2024Updated last year
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated 11 months ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
ParCIS / Chimera
View on GitHub
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆72Mar 20, 2025Updated last year
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
xuqifan897 / Optimus
View on GitHub
☆28Jul 11, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
hku-systems / vpipe
View on GitHub
☆25Apr 3, 2023Updated 3 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated last year
Youhe-Jiang / IJCAI2023-OptimalShardedDataParallel
View on GitHub
[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…
☆52May 31, 2023Updated 3 years ago
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
TonyTangYu / pytorch
View on GitHub
DELTA-pytorch：DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
☆12Apr 16, 2024Updated 2 years ago
PasaLab / Liquid
View on GitHub
Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters
☆16Nov 18, 2021Updated 4 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
geoffxy / habitat
View on GitHub
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆63Nov 26, 2022Updated 3 years ago
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
spcl / substation
View on GitHub
Research and development for optimizing transformers
☆132Feb 16, 2021Updated 5 years ago
alpa-projects / alpa
View on GitHub
Training and serving large-scale neural networks with auto parallelization.
☆3,179Dec 9, 2023Updated 2 years ago
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
kakaobrain / torchgpipe
View on GitHub
A GPipe implementation in PyTorch
☆865Jul 25, 2024Updated last year
uclasystem / dorylus
View on GitHub
Dorylus: Affordable, Scalable, and Accurate GNN Training
☆76May 31, 2021Updated 5 years ago
AmadeusChan / Awesome-LLM-System-Papers
View on GitHub
☆645Jan 14, 2026Updated 5 months ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year