michaelfarrell76 / Distributed-SGDLinks
Parallel SGD, done locally and remote
☆14Updated 9 years ago
Alternatives and similar repositories for Distributed-SGD
Users that are interested in Distributed-SGD are comparing it to the libraries listed below
Sorting:
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 7 years ago
- Proceedings of ICML 2018☆39Updated last week
- ☆42Updated 6 years ago
- ☆77Updated 6 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 7 years ago
- ☆38Updated 7 years ago
- Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch☆62Updated 4 years ago
- ☆19Updated 7 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Updated 7 years ago
- TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.☆44Updated 7 years ago
- numpy implementation of net 2 net from the paper Net2Net: Accelerating Learning via Knowledge Transfer http://arxiv.org/abs/1511.05641☆53Updated 9 years ago
- meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)☆110Updated 3 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆43Updated 7 years ago
- Tensorflow and Numpy Implementation of Net2Net (http://arxiv.org/abs/1511.05641)☆48Updated 7 years ago
- Implementation of paper "GibbsNet: Iterative Adversarial Inference for Deep Graphical Models" in PyTorch☆57Updated 8 years ago
- ☆34Updated 7 years ago
- ☆13Updated 7 years ago
- ☆79Updated 8 years ago
- A Keras inspired training utility for PyTorch☆38Updated 7 years ago
- Proceedings of ICML 2017☆24Updated 3 years ago
- ☆46Updated 7 years ago
- Convexified Convolutional Neural Networks☆15Updated 9 years ago
- PyTorch Impl. of Prediction Optimizer (to stabilize GAN training)☆31Updated 7 years ago
- Compare outputs between layers written in Tensorflow and layers written in Pytorch☆72Updated 7 years ago
- Automatically build the deep learning models with ENAS☆31Updated 7 years ago
- Wide-residual network implementations. Best result for cifar10(97.12%), cifar100(84.12%), and other kaggle challenges☆37Updated 9 years ago
- Various experiments on the [Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist) dataset from Zalando