pytorch / elastic
PyTorch elastic training
☆730Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for elastic
- A GPU performance profiling tool for PyTorch models☆495Updated 3 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 4 years ago
- A GPipe implementation in PyTorch☆818Updated 3 months ago
- PyTorch on Kubernetes☆307Updated 2 years ago
- PyTorch layer-by-layer model profiler☆608Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆936Updated 10 months ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 3 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆332Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆249Updated last year
- High performance model preprocessing library on PyTorch☆649Updated 7 months ago
- Collective communications library with various primitives for multi-machine training.☆1,227Updated this week
- TVM integration into PyTorch☆452Updated 4 years ago
- Slicing a PyTorch Tensor Into Parallel Shards☆296Updated 3 years ago
- A performant and modular runtime for TensorFlow☆756Updated last month
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,027Updated last year
- ☆378Updated 2 years ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,591Updated last year
- Bagua Speeds up PyTorch☆876Updated 3 months ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,210Updated this week
- common in-memory tensor structure☆912Updated last month
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆253Updated 2 years ago
- Implementations of ideas from recent papers☆391Updated 3 years ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆193Updated 2 years ago
- Python SDK for building, training, and deploying ML models☆337Updated 2 years ago
- A multi-model machine learning feature embedding database☆633Updated 4 years ago
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆262Updated 4 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,801Updated 11 months ago
- Lightweight and Parallel Deep Learning Framework☆263Updated last year
- Profiling and inspecting memory in pytorch☆1,020Updated 3 months ago
- PyTorchPipe (PTP) is a component-oriented framework for rapid prototyping and training of computational pipelines combining vision and la…☆225Updated 5 years ago