pytorch / elastic
PyTorch elastic training
☆730Updated 2 years ago
Alternatives and similar repositories for elastic:
Users that are interested in elastic are comparing it to the libraries listed below
- A GPipe implementation in PyTorch☆828Updated 6 months ago
- A GPU performance profiling tool for PyTorch models☆501Updated 3 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆685Updated 4 years ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,604Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆345Updated this week
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- A uniform interface to run deep learning models from multiple frameworks☆936Updated last year
- A performant and modular runtime for TensorFlow☆759Updated last week
- TVM integration into PyTorch☆452Updated 5 years ago
- PyTorch layer-by-layer model profiler☆606Updated 3 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆253Updated 2 years ago
- High performance model preprocessing library on PyTorch☆651Updated 10 months ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆767Updated this week
- Bagua Speeds up PyTorch☆879Updated 6 months ago
- Slicing a PyTorch Tensor Into Parallel Shards☆298Updated 3 years ago
- Collective communications library with various primitives for multi-machine training.☆1,263Updated last week
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,035Updated last year
- Implementations of ideas from recent papers☆391Updated 4 years ago
- Easily benchmark machine learning models in PyTorch☆149Updated 10 months ago
- For recording and retrieving metadata associated with ML developer and data scientist workflows.☆637Updated 3 months ago
- common in-memory tensor structure☆942Updated last week
- ☆387Updated 2 years ago
- A multi-model machine learning feature embedding database☆636Updated 5 years ago
- Python library to easily log experiments and parallelize hyperparameter search for neural networks☆735Updated 2 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,029Updated 10 months ago
- Pytorch Lightning Distributed Accelerators using Ray☆211Updated last year
- Model analysis tools for TensorFlow☆1,261Updated this week
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆260Updated 4 years ago
- Implementation of https://arxiv.org/abs/1904.00962☆371Updated 4 years ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,817Updated last year