pytorch / elastic
PyTorch elastic training
☆730Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for elastic
- A GPU performance profiling tool for PyTorch models☆493Updated 3 years ago
- A GPipe implementation in PyTorch☆814Updated 3 months ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 4 years ago
- PyTorch layer-by-layer model profiler☆608Updated 3 years ago
- PyTorch on Kubernetes☆306Updated 2 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆332Updated 2 weeks ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 3 years ago
- High performance model preprocessing library on PyTorch☆648Updated 7 months ago
- A uniform interface to run deep learning models from multiple frameworks☆936Updated 10 months ago
- TVM integration into PyTorch☆453Updated 4 years ago
- common in-memory tensor structure☆905Updated last month
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆727Updated this week
- Slicing a PyTorch Tensor Into Parallel Shards☆296Updated 3 years ago
- Profiling and inspecting memory in pytorch☆1,018Updated 3 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,591Updated 11 months ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,010Updated 6 months ago
- Pytorch Lightning Distributed Accelerators using Ray☆211Updated last year
- PyTorch, TensorFlow, JAX and NumPy — all of them natively using the same code☆696Updated last year
- ☆376Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆247Updated last year
- A performant and modular runtime for TensorFlow☆756Updated 3 weeks ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆253Updated last year
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,025Updated last year
- functorch is JAX-like composable function transforms for PyTorch.☆1,394Updated this week
- Implementations of ideas from recent papers☆391Updated 3 years ago
- Collective communications library with various primitives for multi-machine training.☆1,219Updated this week
- A multi-model machine learning feature embedding database☆631Updated 4 years ago
- Guide for building custom op for TensorFlow☆378Updated last year
- Lightweight and Parallel Deep Learning Framework☆263Updated last year
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,131Updated this week