tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆35Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for understanding-sam
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆33Updated 9 months ago
- Source code of "What can linearized neural networks actually say about generalization?☆18Updated 3 years ago
- ☆55Updated 4 years ago
- Training vision models with full-batch gradient descent and regularization☆38Updated last year
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆27Updated last year
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆40Updated last year
- ☆37Updated 3 years ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 4 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Updated 3 years ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆39Updated 10 months ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆67Updated 6 months ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 3 years ago
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆53Updated 2 years ago
- ☆61Updated 3 years ago
- ☆23Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆25Updated last year
- ☆34Updated 3 years ago
- ☆59Updated 3 years ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 2 years ago
- Simple CIFAR10 ResNet example with JAX.☆21Updated 3 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆60Updated 3 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆45Updated 9 months ago
- ☆25Updated last year
- ☆58Updated last year
- ☆35Updated last year
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated last year