microsoft / Delayed-Compensation-Asynchronous-Stochastic-Gradient-Descent-for-Multiverso
Asynchronous Stochastic Gradient Descent with Delay Compensation
☆21Updated 7 years ago
Alternatives and similar repositories for Delayed-Compensation-Asynchronous-Stochastic-Gradient-Descent-for-Multiverso:
Users that are interested in Delayed-Compensation-Asynchronous-Stochastic-Gradient-Descent-for-Multiverso are comparing it to the libraries listed below
- ☆12Updated 6 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆51Updated 6 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- image to column☆30Updated 10 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- A platform for distributed optimization expriments using OpenMPI☆21Updated 7 years ago
- Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…☆13Updated 9 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- Proximal Asynchronous SAGA☆12Updated 7 years ago
- ☆42Updated 5 years ago
- RDMA Optimization on MXNet☆14Updated 7 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 6 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- An analytical performance modeling tool for deep neural networks.☆88Updated 4 years ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆70Updated 3 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- A lightweight parameter server interface☆76Updated 2 years ago
- DMALab's reading group slides and papers.☆17Updated 3 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 7 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆33Updated 8 years ago
- LIBBLE by Parameter Server☆17Updated 6 years ago
- Implementing Google's DistBelief paper☆109Updated 2 years ago
- ☆12Updated 4 years ago
- DLPack for Tensorflow☆36Updated 4 years ago
- (Spring 2017) Assignment 2: GPU Executor☆62Updated 7 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 5 years ago
- ☆24Updated 7 years ago