PyTorch elastic training
☆729Jun 15, 2022Updated 3 years ago
Alternatives and similar repositories for elastic
Users that are interested in elastic are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,403Apr 26, 2025Updated 10 months ago
- Serve, optimize and scale PyTorch models in production☆4,362Aug 6, 2025Updated 7 months ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,936Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,679Dec 1, 2025Updated 3 months ago
- A tensor-aware point-to-point communication primitive for machine learning☆284Dec 17, 2025Updated 3 months ago
- A high performance and generic framework for distributed DNN training☆3,716Oct 3, 2023Updated 2 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,628Mar 25, 2022Updated 3 years ago
- A GPipe implementation in PyTorch☆862Jul 25, 2024Updated last year
- A GPU performance profiling tool for PyTorch models☆510Jul 13, 2021Updated 4 years ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,642Mar 13, 2026Updated last week
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,752Mar 11, 2026Updated last week
- Collective communications library with various primitives for multi-machine training.☆1,405Mar 11, 2026Updated last week
- PyTorch layer-by-layer model profiler☆606May 23, 2021Updated 4 years ago
- Compiler for Neural Network hardware accelerators☆3,326May 11, 2024Updated last year
- PyTorch on Kubernetes☆309Dec 1, 2021Updated 4 years ago
- Configure Python functions explicitly and safely☆129Nov 18, 2024Updated last year
- Implementations of ideas from recent papers☆391Dec 22, 2020Updated 5 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,880Jan 2, 2026Updated 2 months ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,167Mar 22, 2024Updated last year
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Feb 21, 2020Updated 6 years ago
- A uniform interface to run deep learning models from multiple frameworks☆940Jan 3, 2024Updated 2 years ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,958Updated this week
- Kubernetes-native Deep Learning Framework☆746Jan 26, 2024Updated 2 years ago
- Bagua Speeds up PyTorch☆884Aug 1, 2024Updated last year
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,124Apr 20, 2022Updated 3 years ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,437Aug 21, 2025Updated 7 months ago
- A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)☆5,623Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,077Apr 17, 2024Updated last year
- ☆169Feb 20, 2021Updated 5 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,756Dec 18, 2025Updated 3 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆421Updated this week
- Mesh TensorFlow: Model Parallelism Made Easier☆1,625Nov 17, 2023Updated 2 years ago
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆30,926Mar 10, 2026Updated last week
- Model interpretability and understanding for PyTorch☆5,580Mar 11, 2026Updated last week
- 🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code☆2,825Jun 23, 2023Updated 2 years ago
- Generate embeddings from large-scale graph-structured data.☆3,459Mar 3, 2024Updated 2 years ago
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,697Mar 9, 2026Updated last week
- Bayesian optimization in PyTorch☆3,481Updated this week