meta-pytorch/torchx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/meta-pytorch/torchx)

meta-pytorch / torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

☆427

Alternatives and similar repositories for torchx

Users that are interested in torchx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pytorch / torchdistx
View on GitHub
Torch Distributed Experimental
☆117Aug 5, 2024Updated last year
meta-pytorch / torchsnapshot
View on GitHub
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆165Jun 10, 2026Updated last month
pytorch / torcharrow
View on GitHub
High performance model preprocessing library on PyTorch
☆641Mar 29, 2024Updated 2 years ago
pytorch / PiPPy
View on GitHub
Pipeline Parallelism for PyTorch
☆786Aug 21, 2024Updated last year
meta-pytorch / torchft
View on GitHub
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆526Jul 16, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pytorch / functorch
View on GitHub
functorch is JAX-like composable function transforms for PyTorch.
☆1,434Aug 21, 2025Updated 11 months ago
meta-pytorch / data
View on GitHub
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
☆1,258Updated this week
meta-pytorch / multipy
View on GitHub
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆179Dec 16, 2025Updated 7 months ago
pytorch / kineto
View on GitHub
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆975Updated this week
meta-pytorch / monarch
View on GitHub
PyTorch Single Controller
☆1,060Updated this week
pytorch / tensorpipe
View on GitHub
A tensor-aware point-to-point communication primitive for machine learning
☆286Dec 17, 2025Updated 7 months ago
NVIDIA / nvidia-resiliency-ext
View on GitHub
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …
☆311Updated this week
pytorch / torchdynamo
View on GitHub
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,078Apr 17, 2024Updated 2 years ago
meta-pytorch / autoparallel
View on GitHub
An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.
☆89Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ray-project / distml
View on GitHub
Distributed ML Optimizer
☆35Jul 28, 2021Updated 4 years ago
meta-pytorch / torchstore
View on GitHub
A storage solution for PyTorch tensors with distributed tensor support.
☆81Jul 17, 2026Updated last week
pytorch / ort
View on GitHub
Accelerate PyTorch models with ONNX Runtime
☆369Feb 5, 2026Updated 5 months ago
pytorch / elastic
View on GitHub
PyTorch elastic training
☆727Jun 15, 2022Updated 4 years ago
meta-pytorch / torcheval
View on GitHub
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆248May 15, 2026Updated 2 months ago
facebookresearch / fairscale
View on GitHub
PyTorch extensions for high performance and large scale training.
☆3,411Apr 26, 2025Updated last year
meta-pytorch / torchcomms
View on GitHub
torchcomms: a modern PyTorch communications API
☆380Updated this week
meta-pytorch / torchforge
View on GitHub
PyTorch-native post-training at scale
☆696Updated this week
meta-pytorch / torchrec
View on GitHub
Pytorch domain library for recommendation systems
☆2,588Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pytorch / benchmark
View on GitHub
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆1,043Updated this week
pytorch / ao
View on GitHub
PyTorch native quantization and sparsity for training and inference
☆2,910Updated this week
facebookresearch / HolisticTraceAnalysis
View on GitHub
A library to analyze PyTorch traces.
☆535May 29, 2026Updated last month
pytorch / rfcs
View on GitHub
PyTorch RFCs (experimental)
☆148Jul 14, 2026Updated last week
alpa-projects / alpa
View on GitHub
Training and serving large-scale neural networks with auto parallelization.
☆3,180Dec 9, 2023Updated 2 years ago
meta-pytorch / tnt
View on GitHub
A lightweight library for PyTorch training tools and utilities
☆1,721Jul 16, 2026Updated last week
pytorch / serve
View on GitHub
Serve, optimize and scale PyTorch models in production
☆4,350Aug 6, 2025Updated 11 months ago
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,554Updated this week
pytorch / tensordict
View on GitHub
TensorDict is a pytorch dedicated tensor container.
☆1,034Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ray-project / ray_lightning
View on GitHub
Pytorch Lightning Distributed Accelerators using Ray
☆215Nov 3, 2023Updated 2 years ago
pytorch / gloo
View on GitHub
Collective communications library with various primitives for multi-machine training.
☆1,438Jul 1, 2026Updated 3 weeks ago
NVIDIA / PyProf
View on GitHub
A GPU performance profiling tool for PyTorch models
☆510Jul 13, 2021Updated 5 years ago
facebookresearch / fairring
View on GitHub
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆66Mar 21, 2022Updated 4 years ago
BaguaSys / bagua
View on GitHub
Bagua Speeds up PyTorch
☆881Aug 1, 2024Updated last year
mosaicml / streaming
View on GitHub
A Data Streaming Library for Efficient Neural Network Training
☆1,535Jun 25, 2026Updated 3 weeks ago
microsoft / varuna
View on GitHub
☆251Jul 25, 2024Updated last year