snuspl / parallax
A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
☆130Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for parallax
- Lightweight and Parallel Deep Learning Framework☆263Updated last year
- Python bindings for NVTX☆66Updated last year
- ☆25Updated 5 years ago
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆70Updated 3 years ago
- Simple Distributed Deep Learning on TensorFlow☆134Updated 2 years ago
- this is the release repository of superneurons☆52Updated 3 years ago
- Repository for SysML19 Artifacts Evaluation☆53Updated 5 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆247Updated last year
- ☆82Updated 2 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- Research and development for optimizing transformers☆124Updated 3 years ago
- Runtime Tracing Library for TensorFlow☆42Updated 5 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆97Updated 2 years ago
- ☆24Updated last year
- ☆47Updated last year
- Training neural networks in TensorFlow 2.0 with 5x less memory☆128Updated 2 years ago
- Fine-grained GPU sharing primitives☆140Updated 4 years ago
- An analytical performance modeling tool for deep neural networks.☆87Updated 4 years ago
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- Place for meetup slides☆140Updated 4 years ago
- Model-less Inference Serving☆82Updated last year
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆124Updated 2 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆194Updated 2 years ago
- ☆21Updated last year
- Study Group of Deep Learning Compiler☆152Updated last year
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last year
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆63Updated 6 years ago
- ☆20Updated 5 years ago