kudkudak / dnn_sharpest_directionsLinks

Code for "On the Relation Between the Sharpest Directions of DNN Loss and the SGD Step Length", ICLR 2019

☆11

Alternatives and similar repositories for dnn_sharpest_directions

Users that are interested in dnn_sharpest_directions are comparing it to the libraries listed below

Sorting:

wenwei202 / smoothout
SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning
☆23Updated 6 years ago
eladhoffer / fix_your_classifier
☆34Updated 6 years ago
oval-group / dfw
Implementation of the Deep Frank-Wolfe Algorithm -- Pytorch
☆62Updated 4 years ago
insperatum / vhe
The Variational Homoencoder: Learning to learn high capacity generative models from few examples
☆34Updated 2 years ago
renmengye / meta-optim-public
Understanding Short-Horizon Bias in Stochastic Meta-Optimization
☆37Updated 7 years ago
bneyshabur / over-parametrization
Computing various norms/measures on over-parametrized neural networks
☆49Updated 6 years ago
Healbadbad / curveball-pytorch
An Implementation of "Small steps and giant leaps: Minimal Newton solvers for Deep Learning" In pytorch
☆21Updated 7 years ago
uclaml / Padam
Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …
☆39Updated 2 years ago
erogol / Net2Net
Net2Net implementation on PyTorch for any possible vision layers.
☆38Updated 7 years ago
lballes / msvag
TensorFlow implementation of (Momentum) Stochastic Variance-Adapted Gradient.
☆44Updated 7 years ago
ravidziv / Information-bottleneck
Python implementation of the infomration bottleneck method (tishby et al, 1999)
☆36Updated 8 years ago
eleniTriantafillou / few_shot_mAP_public
This repository contains the code for the paper "Few-Shot Learning Through an Information Retrieval Lens". Eleni Triantafillou, Richard Z…
☆25Updated 7 years ago
vsyrgkanis / optimistic_GAN_training
☆46Updated 7 years ago
wlwkgus / GibbsNet
Implementation of paper "GibbsNet: Iterative Adversarial Inference for Deep Graphical Models" in PyTorch
☆57Updated 7 years ago
ucla-vision / information-dropout
Implementation of Information Dropout
☆39Updated 8 years ago
locuslab / lml
The Limited Multi-Label Projection Layer
☆59Updated 11 months ago
ramprs / neuron-importance-zsl
[ECCV 2018] code for Choose Your Neuron: Incorporating Domain Knowledge Through Neuron Importance
☆57Updated 6 years ago
stevenygd / SWALP
Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".
☆62Updated 6 years ago
hongyanz / multibranch
Codes for the paper "Deep Neural Networks with Multi-Branch Architectures Are Less Non-Convex"
☆20Updated 4 years ago
BayesWatch / pytorch-blockswap
Code for BlockSwap (ICLR 2020).
☆33Updated 4 years ago
Zeta36 / random-memory-adaptation
Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"
☆24Updated 7 years ago
huangleiBuaa / OthogonalWN
This project is the Torch implementation of our accepted AAAI 2018 paper : orthogonal weight normalization method for solving orthogonali…
☆57Updated 5 years ago
moskomule / shampoo.pytorch
An implementation of shampoo
☆75Updated 7 years ago
demelin / Noise-Contrastive-Estimation-NCE-for-pyTorch
Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…
☆44Updated 5 years ago
eladhoffer / norm_matters
☆23Updated 6 years ago
michaelfarrell76 / Distributed-SGD
Parallel SGD, done locally and remote
☆14Updated 9 years ago
awentzonline / pytorch-cns
Generalized Compressed Network Search with PyTorch
☆26Updated 7 years ago
zhangyuc / CCNN
Convexified Convolutional Neural Networks
☆15Updated 8 years ago
BorealisAI / bre-gan
Code for ICLR2018 paper: Improving GAN Training via Binarized Representation Entropy (BRE) Regularization - Y. Cao · W Ding · Y.C. Lui · …
☆20Updated 7 years ago
mingzhang-yin / ARM-gradient
Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)
☆28Updated 6 years ago