DMALab / Reading_GroupLinks
DMALab's reading group slides and papers.
☆16Updated 4 years ago
Alternatives and similar repositories for Reading_Group
Users that are interested in Reading_Group are comparing it to the libraries listed below
Sorting:
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 6 years ago
- GRACE - GRAdient ComprEssion for distributed deep learning☆139Updated last year
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 8 years ago
- CS294; AI For Systems and Systems For AI☆224Updated 5 years ago
- ☆33Updated 6 years ago
- Implementation of Parameter Server using PyTorch communication lib☆42Updated 6 years ago
- Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.☆297Updated last year
- Accelerating Distributed Machine Learning with Data Sketches☆17Updated 6 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆51Updated 6 years ago
- Stochastic Gradient Push for Distributed Deep Learning☆170Updated 2 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- Simple Distributed Deep Learning on TensorFlow☆133Updated 2 months ago
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆408Updated 2 months ago
- ☆22Updated 6 years ago
- Asynchronous Stochastic Gradient Descent with Delay Compensation☆21Updated 8 years ago
- ☆392Updated 2 years ago
- Atomo: Communication-efficient Learning via Atomic Sparsification☆27Updated 6 years ago
- ☆588Updated 7 years ago
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆147Updated 9 months ago
- ☆21Updated 2 years ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆318Updated 3 weeks ago
- Some tensorflow examples☆19Updated 7 years ago
- 位置敏感哈希LSH应用用计算向量最大内积和☆11Updated 7 years ago
- papers on scalable and efficient machine learning systems☆192Updated 6 years ago
- Large batch training of CTR models based on DeepCTR with CowClip.☆170Updated 2 years ago
- ☆142Updated 2 months ago
- ☆25Updated 6 years ago
- Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial☆264Updated 2 years ago
- ☆371Updated 7 years ago
- QSGD-TF☆21Updated 6 years ago