ucla-labx / distbeliefLinks
Implementing Google's DistBelief paper
☆110Updated 2 years ago
Alternatives and similar repositories for distbelief
Users that are interested in distbelief are comparing it to the libraries listed below
Sorting:
- implement distributed machine learning with Pytorch + OpenMPI☆51Updated 6 years ago
- Distributed Learning by Pair-Wise Averaging☆52Updated 7 years ago
- DAWNBench: An End-to-End Deep Learning Benchmark and Competition☆262Updated 4 years ago
- ☆75Updated 6 years ago
- GPU-specialized parameter server for GPU machine learning.☆101Updated 7 years ago
- ☆53Updated 7 years ago
- An analytical performance modeling tool for deep neural networks.☆89Updated 4 years ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Updated 7 years ago
- Deep learning system course☆215Updated 6 years ago
- This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.☆35Updated last month
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Updated 6 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52Updated 5 years ago
- ☆42Updated 5 years ago
- PyProf2: PyTorch Profiling tool☆82Updated 5 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Code for the neural architecture search methods contained in the paper Efficient Forward Neural Architecture Search☆110Updated 2 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆43Updated 8 years ago
- Some tensorflow examples☆19Updated 7 years ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆69Updated 3 years ago
- Simple Distributed Deep Learning on TensorFlow☆133Updated last week
- Example codes appears in lectures☆23Updated 3 years ago
- Analyze TensorFlow source code☆19Updated 8 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆41Updated 6 years ago
- Minimal Tensorflow implementation of the paper "Neural Architecture Search With Reinforcement Learning" presented at ICLR 2017☆41Updated 7 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆70Updated 8 years ago
- Efficient Architecture Search by Network Transformation, in AAAI 2018☆169Updated 6 years ago
- Codebase associated with the PyTorch compiler tutorial☆46Updated 5 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Updated 6 years ago
- Move to https://github.com/apache/incubator-tvm-site☆26Updated 4 years ago