pytorch / torchxLinks

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

☆378

Alternatives and similar repositories for torchx

Users that are interested in torchx are comparing it to the libraries listed below

Sorting:

pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆158Updated last month
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆400Updated this week
ray-project / ray_lightning
Pytorch Lightning Distributed Accelerators using Ray
☆213Updated last year
pytorch / rfcs
PyTorch RFCs (experimental)
☆133Updated 2 months ago
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆259Updated 2 years ago
pytorch / torcharrow
High performance model preprocessing library on PyTorch
☆649Updated last year
pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated 3 weeks ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆372Updated this week
pytorch / test-infra
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆96Updated this week
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆842Updated last week
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆364Updated 5 months ago
pytorch / PiPPy
Pipeline Parallelism for PyTorch
☆775Updated 11 months ago
gpuopenanalytics / pynvml
Provide Python access to the NVML library for GPU diagnostics
☆242Updated 8 months ago
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆210Updated 3 months ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆157Updated 2 weeks ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆482Updated 2 weeks ago
microsoft / varuna
☆251Updated last year
google / saxml
☆142Updated 2 weeks ago
google / paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…
☆522Updated this week
coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆250Updated this week
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆273Updated 3 years ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆113Updated 2 years ago
google / praxis
☆187Updated this week
pytorch / torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆236Updated 6 months ago
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆156Updated 2 weeks ago
google / aqt
☆323Updated last month
NVIDIA / PyProf
A GPU performance profiling tool for PyTorch models
☆503Updated 4 years ago
pytorch / torchdistx
Torch Distributed Experimental
☆117Updated last year
NVIDIA / nvidia-resiliency-ext
NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …
☆196Updated this week