saareliad/FTPipe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/saareliad/FTPipe)

saareliad / FTPipe

FTPipe and related pipeline model parallelism research.

☆44

Alternatives and similar repositories for FTPipe

Users that are interested in FTPipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
ParCoreLab / ComScribe
View on GitHub
ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.
☆27Jul 6, 2023Updated 3 years ago
casys-kaist / EnvPipe
View on GitHub
☆27Aug 31, 2023Updated 2 years ago
msr-fiddle / dnn-partitioning
View on GitHub
☆42Oct 12, 2020Updated 5 years ago
shriramsb / vDNN
View on GitHub
☆22Nov 7, 2018Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
kazukiosawa / pipe-fisher
View on GitHub
☆10Apr 29, 2023Updated 3 years ago
parasailteam / coconet
View on GitHub
☆85Dec 2, 2022Updated 3 years ago
Sys-KU / DeepPlan
View on GitHub
[ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access
☆56Aug 6, 2025Updated 11 months ago
msr-fiddle / harmony
View on GitHub
☆17Dec 9, 2022Updated 3 years ago
AlibabaPAI / DAPPLE
View on GitHub
An Efficient Pipelined Data Parallel Approach for Training Large Model
☆76Dec 11, 2020Updated 5 years ago
zhuangwang93 / Espresso
View on GitHub
Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…
☆15Sep 21, 2023Updated 2 years ago
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
microsoft / SuperScaler
View on GitHub
An experimental parallel training platform
☆57Mar 25, 2024Updated 2 years ago
msr-fiddle / synergy
View on GitHub
☆54Dec 13, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sara-nl / DDLBench
View on GitHub
Distributed Deep Learning Benchmark Suite
☆11Oct 31, 2022Updated 3 years ago
jasperzhong / swift
View on GitHub
☆15Apr 20, 2022Updated 4 years ago
msr-fiddle / pipedream
View on GitHub
☆394Nov 4, 2022Updated 3 years ago
stanford-futuredata / gavel
View on GitHub
Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020
☆139Jul 25, 2024Updated last year
mlsys-seo / ooo-backprop
View on GitHub
☆26Dec 5, 2022Updated 3 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
hpcaitech / Elixir
View on GitHub
Elixir: Train a Large Language Model on a Small GPU Cluster
☆16Jun 8, 2023Updated 3 years ago
msr-fiddle / DS-Analyzer
View on GitHub
☆39Jan 15, 2021Updated 5 years ago
hku-systems / naspipe
View on GitHub
☆14Jan 12, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
netx-repo / PipeSwitch
View on GitHub
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
☆127May 9, 2022Updated 4 years ago
Raphael-Hao / brainstorm
View on GitHub
Compiler for Dynamic Neural Networks
☆45Nov 13, 2023Updated 2 years ago
HPDL-Group / Merak
View on GitHub
☆86Feb 11, 2026Updated 5 months ago
uw-mad-dash / shockwave
View on GitHub
Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]
☆46Nov 24, 2022Updated 3 years ago
jashwantraj92 / cocktail
View on GitHub
☆16Aug 15, 2024Updated last year
DachengLi1 / AMP
View on GitHub
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆44Nov 4, 2022Updated 3 years ago
hku-systems / vpipe
View on GitHub
☆25Apr 3, 2023Updated 3 years ago
Distributed-AI / PipeTransformer
View on GitHub
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models. ICML 2021
☆56Jul 21, 2021Updated 5 years ago
SJTU-IPADS / reef
View on GitHub
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆108Dec 24, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xbfu / PyTorch-ParameterServer
View on GitHub
An implementation of parameter server framework in PyTorch RPC.
☆12Nov 12, 2021Updated 4 years ago
darchr / AutoTM
View on GitHub
Thinking is hard - automate it
☆18Aug 24, 2022Updated 3 years ago
SJTU-IPADS / disb
View on GitHub
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆58Aug 21, 2024Updated last year
anandj91 / p3
View on GitHub
☆21Nov 29, 2022Updated 3 years ago
microsoft / nnscaler
View on GitHub
nnScaler: Compiling DNN models for Parallel Training
☆135Jul 2, 2026Updated 2 weeks ago
alondj / Pytorch-Gpipe
View on GitHub
☆26Nov 13, 2019Updated 6 years ago
raywan-110 / AdaQP
View on GitHub
Adaptive Message Quantization and Parallelization for Distributed Full-graph GNN Training
☆24Mar 1, 2024Updated 2 years ago