Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"
☆146Apr 24, 2017Updated 8 years ago
Alternatives and similar repositories for large-batch-training
Users that are interested in large-batch-training are comparing it to the libraries listed below
Sorting:
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Nov 21, 2018Updated 7 years ago
- Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…☆149May 25, 2017Updated 8 years ago
- Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.☆44Mar 4, 2017Updated 9 years ago
- DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch☆30Aug 30, 2016Updated 9 years ago
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- ☆255Nov 23, 2016Updated 9 years ago
- Low-rank Highway Networks☆13Mar 11, 2016Updated 9 years ago
- Neural network training using iterated projections.☆90Jan 17, 2017Updated 9 years ago
- Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).☆130Oct 31, 2017Updated 8 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Oct 13, 2016Updated 9 years ago
- Recurrent Convolutional Memory Network (in progress)☆29Apr 16, 2016Updated 9 years ago
- Code and models from the paper "Layer Normalization"☆243Nov 8, 2016Updated 9 years ago
- Implementation of Shake-Shake by chainer (Shake-Shake regularization of 3-branch residual networks: https://openreview.net/forum?id=HkO-P…☆10Aug 24, 2017Updated 8 years ago
- ☆12Oct 8, 2016Updated 9 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Sep 30, 2016Updated 9 years ago
- ☆69Dec 19, 2018Updated 7 years ago
- Deep Learning Dashboard☆38Sep 4, 2016Updated 9 years ago
- Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)☆174Nov 3, 2016Updated 9 years ago
- Multi-Residual Networks☆23Nov 25, 2016Updated 9 years ago
- Doubly Stochastic Neighbor Embedding on Spheres☆60Sep 13, 2019Updated 6 years ago
- An empirical investigation of deep learning theory☆16Oct 3, 2019Updated 6 years ago
- Structured Prediction Energy Networks in Torch☆132Feb 8, 2017Updated 9 years ago
- argparse extension for hpman☆17Dec 4, 2022Updated 3 years ago
- a much more complex case using GradNorm, where the layer sharing situation is sophisticated.☆15Feb 21, 2019Updated 7 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆19Oct 28, 2020Updated 5 years ago
- Unsupervised learning of visual concepts from video☆56May 5, 2016Updated 9 years ago
- Tweet Classification using RNN and CNN☆43Sep 18, 2016Updated 9 years ago
- Implementation of "Domain-adaptive deep network compression", ICCV 2017☆28Jul 12, 2018Updated 7 years ago
- Lasagne code for weight normalization☆88Apr 3, 2016Updated 9 years ago
- See the wrold with ResNet☆14May 1, 2017Updated 8 years ago
- ☆16Jun 18, 2025Updated 8 months ago
- AutoML framework balancing performance and complexity☆15May 9, 2021Updated 4 years ago
- A more memory efficient Torch implementation of "Densely Connected Convolutional Networks".☆29May 11, 2017Updated 8 years ago
- 3.8% and 18.3% on CIFAR-10 and CIFAR-100☆1,310Aug 20, 2019Updated 6 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆365Nov 22, 2018Updated 7 years ago
- ☆221Feb 15, 2020Updated 6 years ago
- Unofficial Pytorch implementation of the paper Filter Response Normalization.☆19Dec 9, 2019Updated 6 years ago
- Machine Learning - A Friendly Handbook (Open Notes)☆19Feb 26, 2017Updated 9 years ago
- The project is about predicting sets (of classes) from images.☆23Aug 31, 2021Updated 4 years ago