keskarnitish/large-batch-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/keskarnitish/large-batch-training)

keskarnitish / large-batch-training

Code to reproduce some of the figures in the paper "On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima"

☆147

Alternatives and similar repositories for large-batch-training

Users that are interested in large-batch-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenwei202 / smoothout
View on GitHub
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
☆23Nov 21, 2018Updated 7 years ago
eladhoffer / bigBatch
View on GitHub
Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training o…
☆148May 25, 2017Updated 9 years ago
kbullaughey / dni-synthetic-gradients
View on GitHub
Torch implementation reproducing MNIST experiments from DeepMind's DNI paper.
☆44Mar 4, 2017Updated 9 years ago
leiwu0 / sgd.stability
View on GitHub
Analyze the dynamic stability of SGD
☆13Nov 25, 2018Updated 7 years ago
szagoruyko / openai-gemm.pytorch
View on GitHub
PyTorch bindings for openai-gemm
☆20Feb 6, 2017Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
CosmosShadow / DNI_Torch
View on GitHub
DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Torch
☆30Aug 30, 2016Updated 9 years ago
loshchil / SGDR
View on GitHub
☆257Nov 23, 2016Updated 9 years ago
Cadene / torchnet-m2caiworkflow
View on GitHub
Finalist entry for the M2CAI Workflow Challenge 2016
☆10Nov 25, 2016Updated 9 years ago
Avmb / lowrank-highwaynetwork
View on GitHub
Low-rank Highway Networks
☆13Mar 11, 2016Updated 10 years ago
goldblum / TruthOrBackpropaganda
View on GitHub
An empirical investigation of deep learning theory
☆16Oct 3, 2019Updated 6 years ago
carpedm20 / RCMN
View on GitHub
Recurrent Convolutional Memory Network (in progress)
☆29Apr 16, 2016Updated 10 years ago
nutszebra / shake_shake
View on GitHub
Implementation of Shake-Shake by chainer (Shake-Shake regularization of 3-branch residual networks: https://openreview.net/forum?id=HkO-P…
☆10Aug 24, 2017Updated 8 years ago
jn2clark / nn-iterated-projections
View on GitHub
Neural network training using iterated projections.
☆89Jan 17, 2017Updated 9 years ago
daodaofr / caffe-re-id
View on GitHub
☆12Oct 8, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
o19s / solr-movielens-recommender
View on GitHub
Movielens collaborative filtering with Solr streaming expression
☆10Oct 13, 2016Updated 9 years ago
jayanthkoushik / sgd-feedback
View on GitHub
☆69Dec 19, 2018Updated 7 years ago
jhkim89 / PyramidNet
View on GitHub
Torch implementation of the paper "Deep Pyramidal Residual Networks" (https://arxiv.org/abs/1610.02915).
☆131Oct 31, 2017Updated 8 years ago
jnhwkim / ddx
View on GitHub
Deep Learning Dashboard
☆39Sep 4, 2016Updated 9 years ago
masabdi / multi-resnet
View on GitHub
Multi-Residual Networks
☆23Nov 25, 2016Updated 9 years ago
ryankiros / layer-norm
View on GitHub
Code and models from the paper "Layer Normalization"
☆243Nov 8, 2016Updated 9 years ago
phseo / PAN
View on GitHub
Progressive Attention Networks
☆12Oct 25, 2016Updated 9 years ago
ganeshjawahar / tweet-classify
View on GitHub
Tweet Classification using RNN and CNN
☆43Sep 18, 2016Updated 9 years ago
Este1le / hpo_nmt
View on GitHub
Datasets for Hyperparameter Optimization of Neural Machine Translation
☆10Aug 19, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
uuujf / SGDNoise
View on GitHub
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆15Apr 12, 2020Updated 6 years ago
davidBelanger / SPEN
View on GitHub
Structured Prediction Energy Networks in Torch
☆132Feb 8, 2017Updated 9 years ago
tomgoldstein / loss-landscape
View on GitHub
Code for visualizing the loss landscape of neural nets
☆3,194Apr 5, 2022Updated 4 years ago
jiamings / fast-weights
View on GitHub
Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)
☆173Nov 3, 2016Updated 9 years ago
ryoungj / optdom
View on GitHub
[ICLR'22] Self-supervised learning optimally robust representations for domain shift.
☆25Feb 2, 2022Updated 4 years ago
erogol / resnet.torch
View on GitHub
an updated version of fb.resnet.torch with many changes.
☆38Dec 16, 2016Updated 9 years ago
dhwajraj / spark-text-tagger
View on GitHub
Script to perform dictionary based n-gram text tagging efficiently in apache spark
☆10Sep 30, 2016Updated 9 years ago
benanne / nervana_theano
View on GitHub
A rudimentary wrapper around the fast Maxwell kernels for GEMM and convolution operations provided by nervanagpu
☆34May 7, 2015Updated 11 years ago
willwhitney / understanding-visual-concepts
View on GitHub
Unsupervised learning of visual concepts from video
☆56May 5, 2016Updated 10 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
davidstutz / robust-generalization-flatness
View on GitHub
Implementation of average- and worst-case robust flatness measures for adversarial training.
☆15Nov 5, 2021Updated 4 years ago
szagoruyko / wide-residual-networks
View on GitHub
3.8% and 18.3% on CIFAR-10 and CIFAR-100
☆1,314Aug 20, 2019Updated 6 years ago
gaosh / Structured-Bayesian-Pruning-pytorch
View on GitHub
pytorch implementation of Structured Bayesian Pruning
☆19Jul 13, 2018Updated 8 years ago
fmassa / optimize-net
View on GitHub
OptNet - Reducing memory usage in torch neural nets
☆282Apr 19, 2017Updated 9 years ago
BingzheWu / pytorch_crowd_count
View on GitHub
☆17Aug 22, 2017Updated 8 years ago
yaolubrain / DOSNES
View on GitHub
Doubly Stochastic Neighbor Embedding on Spheres
☆60Sep 13, 2019Updated 6 years ago
snf / keras-fractalnet
View on GitHub
FractalNet implementation in Keras: Ultra-Deep Neural Networks without Residuals
☆156Sep 17, 2017Updated 8 years ago