ucla-labx / distbeliefLinks
Implementing Google's DistBelief paper
☆114Updated 2 years ago
Alternatives and similar repositories for distbelief
Users that are interested in distbelief are comparing it to the libraries listed below
Sorting:
- implement distributed machine learning with Pytorch + OpenMPI☆52Updated 6 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 7 years ago
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆263Updated 5 years ago
- Example codes appears in lectures☆23Updated 3 years ago
- papers on scalable and efficient machine learning systems☆192Updated 7 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 8 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Proximal Asynchronous SAGA☆13Updated 7 years ago
- Efficient Architecture Search by Network Transformation, in AAAI 2018☆168Updated 6 years ago
- Papers and blogs related to distributed deep learning☆96Updated 8 years ago
- An analytical performance modeling tool for deep neural networks.☆91Updated 5 years ago
- Code repository for the paper "Hyperparameter Optimization: A Spectral Approach" by Elad Hazan, Adam Klivans, Yang Yuan.☆174Updated 6 years ago
- Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"☆145Updated 8 years ago
- ☆77Updated 6 years ago
- Deep learning system course☆213Updated 6 years ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆158Updated 3 years ago
- Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017☆40Updated 7 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 5 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 8 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Updated 8 years ago
- Use TensorFlow efficiently☆96Updated 4 years ago
- Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.☆155Updated 7 years ago
- ☆42Updated 6 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 7 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 7 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 7 years ago
- Implements pytorch code for the Accelerated SGD algorithm.☆215Updated 7 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149Updated 8 years ago
- (Spring 2018) Assignment 2: Graph Executor with TVM☆124Updated 7 years ago