ucla-labx / distbeliefLinks
Implementing Google's DistBelief paper
☆110Updated 2 years ago
Alternatives and similar repositories for distbelief
Users that are interested in distbelief are comparing it to the libraries listed below
Sorting:
- implement distributed machine learning with Pytorch + OpenMPI☆51Updated 6 years ago
- An analytical performance modeling tool for deep neural networks.☆89Updated 4 years ago
- GPU-specialized parameter server for GPU machine learning.☆101Updated 7 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- Deep learning system course☆215Updated 6 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 7 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 5 years ago
- Papers and blogs related to distributed deep learning☆96Updated 7 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆263Updated 4 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Code repository for the paper "Hyperparameter Optimization: A Spectral Approach" by Elad Hazan, Adam Klivans, Yang Yuan.☆173Updated 6 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆139Updated 8 years ago
- CS294; AI For Systems and Systems For AI☆224Updated 5 years ago
- [Deprecated] The TensorFlow Profiler (TFProf) UI provides a visual interface for profiling TensorFlow models.☆136Updated 5 years ago
- ☆372Updated 7 years ago
- DLPack for Tensorflow☆35Updated 5 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆130Updated 3 years ago
- Use TensorFlow efficiently☆95Updated 4 years ago
- Plot TensorBoard graphs fast☆51Updated 3 years ago
- Tutorial code on how to build your own Deep Learning System in 2k Lines☆125Updated 8 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 6 years ago
- Efficient Architecture Search by Network Transformation, in AAAI 2018☆169Updated 6 years ago
- Asynchronous Stochastic Gradient Descent with Delay Compensation☆21Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- Code for the neural architecture search methods contained in the paper Efficient Forward Neural Architecture Search☆110Updated last year
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆157Updated 3 years ago