fKunstner / noise-sgd-adam-signLinks

☆17

Alternatives and similar repositories for noise-sgd-adam-sign

Users that are interested in noise-sgd-adam-sign are comparing it to the libraries listed below

Sorting:

tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆35Updated 3 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated 2 years ago
deep-lab / DeepnetHessian
Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)
☆16Updated 6 years ago
epfml / REQ
☆17Updated last year
mueller-mp / SAM-ON
☆34Updated last year
tml-epfl / sgd-sparse-features
SGD with large step sizes learns sparse features [ICML 2023]
☆33Updated 2 years ago
gortizji / linearized-networks
Source code of "What can linearized neural networks actually say about generalization?
☆20Updated 4 years ago
hushon / JAX-ResNet-CIFAR10
Simple CIFAR10 ResNet example with JAX.
☆23Updated 4 years ago
pnnl / torchntk
☆28Updated 2 years ago
MadryLab / BREEDS-Benchmarks
☆55Updated 5 years ago
izmailovpavel / bnn_covariate_shift
Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"
☆33Updated 3 years ago
locuslab / orthogonal-convolutions
Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness
☆47Updated 4 years ago
bneyshabur / generalization-bounds
Computing various measures and generalization bounds on convolutional and fully connected networks
☆35Updated 6 years ago
JonasGeiping / fullbatchtraining
Training vision models with full-batch gradient descent and regularization
☆39Updated 2 years ago
gibipara92 / learning-explanations-hard-to-vary
Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …
☆41Updated 4 years ago
rahimentezari / PermutationInvariance
☆23Updated 2 years ago
harshays / simplicitybiaspitfalls
The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)
☆41Updated last year
nitarshan / robust-generalization-measures
Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)
☆28Updated 4 years ago
locuslab / edge-of-stability
☆71Updated 10 months ago
AnonymousNIPS2019 / DeepnetHessian
The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size
☆17Updated 6 years ago
MadryLab / failure-directions
Distilling Model Failures as Directions in Latent Space
☆47Updated 2 years ago
ssagawa / overparam_spur_corr
An Investigation of Why Overparameterization Exacerbates Spurious Correlations
☆30Updated 5 years ago
facebookresearch / BalancingGroups
Simple data balancing baselines for worst-group-accuracy benchmarks.
☆42Updated 2 years ago
cemanil / LNets
Lipschitz Neural Networks described in "Sorting Out Lipschitz Function Approximation" (ICML 2019).
☆57Updated 5 years ago
bethgelab / InDomainGeneralizationBenchmark
☆34Updated 4 years ago
tding1 / Neural-Collapse
[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features
☆59Updated 3 years ago
yaodongyu / Rethink-BiasVariance-Tradeoff
Rethinking Bias-Variance Trade-off for Generalization of Neural Networks
☆50Updated 4 years ago
team-approx-bayes / fromp
Contains code for the NeurIPS 2020 paper by Pan et al., "Continual Deep Learning by FunctionalRegularisation of Memorable Past"
☆44Updated 4 years ago
wronnyhuang / gen-viz
Code for the paper "Understanding Generalization through Visualizations"
☆64Updated 4 years ago
dydjw9 / Efficient_SAM
☆58Updated 2 years ago