ucla-labx / distbelief
Implementing Google's DistBelief paper
☆109Updated 2 years ago
Alternatives and similar repositories for distbelief:
Users that are interested in distbelief are comparing it to the libraries listed below
- implement distributed machine learning with Pytorch + OpenMPI☆51Updated 6 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 7 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- GPU-specialized parameter server for GPU machine learning.☆101Updated 7 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 4 years ago
- An analytical performance modeling tool for deep neural networks.☆88Updated 4 years ago
- Deep learning system course☆217Updated 6 years ago
- DLPack for Tensorflow☆35Updated 5 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- image to column☆30Updated 10 years ago
- Codebase associated with the PyTorch compiler tutorial☆45Updated 5 years ago
- Analyze TensorFlow source code☆19Updated 8 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆130Updated 3 years ago
- ☆12Updated 6 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 8 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Move to https://github.com/apache/incubator-tvm-site☆26Updated 4 years ago
- Asynchronous Stochastic Gradient Descent with Delay Compensation☆21Updated 7 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 7 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 6 years ago
- CS294; AI For Systems and Systems For AI☆224Updated 5 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- Implementation of Parameter Server using PyTorch communication lib☆43Updated 6 years ago
- ☆53Updated 7 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- Proximal Asynchronous SAGA☆12Updated 7 years ago
- papers on scalable and efficient machine learning systems☆192Updated 6 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 8 years ago
- Efficient Architecture Search by Network Transformation, in AAAI 2018☆169Updated 5 years ago