pytorch/PiPPy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pytorch/PiPPy)

pytorch / PiPPy

Pipeline Parallelism for PyTorch

☆786

Alternatives and similar repositories for PiPPy

Users that are interested in PiPPy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pytorch / torchdistx
View on GitHub
Torch Distributed Experimental
☆117Aug 5, 2024Updated last year
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,541Updated this week
pytorch / torchdynamo
View on GitHub
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,078Apr 17, 2024Updated 2 years ago
alpa-projects / alpa
View on GitHub
Training and serving large-scale neural networks with auto parallelization.
☆3,178Dec 9, 2023Updated 2 years ago
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,434Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / fairscale
View on GitHub
PyTorch extensions for high performance and large scale training.
☆3,410Apr 26, 2025Updated last year
facebookresearch / HolisticTraceAnalysis
View on GitHub
A library to analyze PyTorch traces.
☆535May 29, 2026Updated last month
meta-pytorch / torchsnapshot
View on GitHub
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆165Jun 10, 2026Updated last month
sail-sg / zero-bubble-pipeline-parallelism
View on GitHub
Zero Bubble Pipeline Parallelism
☆462May 7, 2025Updated last year
pytorch / kineto
View on GitHub
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆974Updated this week
pytorch / ao
View on GitHub
PyTorch native quantization and sparsity for training and inference
☆2,906Updated this week
zhuzilin / ring-flash-attention
View on GitHub
Ring attention implementation with flash attention
☆1,037Sep 10, 2025Updated 10 months ago
meta-pytorch / float8_experimental
View on GitHub
This repository contains the experimental PyTorch native float8 training UX
☆226Aug 1, 2024Updated last year
meta-pytorch / torchx
View on GitHub
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆427Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Azure / MS-AMP
View on GitHub
Microsoft Automatic Mixed Precision Library
☆636Dec 1, 2025Updated 7 months ago
deepspeedai / DeepSpeed-MII
View on GitHub
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
☆2,108Jun 30, 2025Updated last year
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,754May 26, 2026Updated last month
databricks / megablocks
View on GitHub
☆1,582Mar 25, 2026Updated 3 months ago
ELS-RD / kernl
View on GitHub
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…
☆1,585Jan 28, 2026Updated 5 months ago
flexflow / flexflow-train
View on GitHub
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
☆1,895Jul 1, 2026Updated 2 weeks ago
triton-lang / triton
View on GitHub
Development repository for the Triton language and compiler
☆19,725Updated this week
awslabs / slapo
View on GitHub
A schedule language for large model training
☆153Aug 21, 2025Updated 10 months ago
volcengine / veScale
View on GitHub
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆1,031Mar 3, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kakaobrain / torchgpipe
View on GitHub
A GPipe implementation in PyTorch
☆865Jul 25, 2024Updated last year
BlackSamorez / tensor_parallel
View on GitHub
Automatically split your PyTorch models on multiple GPUs for training & inference
☆655Jan 2, 2024Updated 2 years ago
foundation-model-stack / fms-fsdp
View on GitHub
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆288Nov 24, 2025Updated 7 months ago
thuml / depyf
View on GitHub
depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.
☆815Oct 13, 2025Updated 9 months ago
hidet-org / hidet
View on GitHub
An open-source efficient deep learning framework/compiler, written in python.
☆743Sep 4, 2025Updated 10 months ago
awslabs / raf
View on GitHub
☆144Jan 30, 2025Updated last year
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,439Mar 27, 2024Updated 2 years ago
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,108Updated this week
pytorch / functorch
View on GitHub
functorch is JAX-like composable function transforms for PyTorch.
☆1,434Aug 21, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
feifeibear / long-context-attention
View on GitHub
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆681May 21, 2026Updated last month
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆5,983Updated this week
meta-pytorch / attention-gym
View on GitHub
Helpful tools and examples for working with flex-attention
☆1,209Updated this week
meta-pytorch / applied-ai
View on GitHub
Applied AI experiments and examples for PyTorch
☆322Aug 22, 2025Updated 10 months ago
albanD / subclass_zoo
View on GitHub
☆192Jun 16, 2024Updated 2 years ago
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,333Updated this week
alibaba / easydist
View on GitHub
Automated Parallelization System and Infrastructure for Multiple Ecosystems
☆82Nov 19, 2024Updated last year