mpezeshki / Gradient_Starvation
Gradient Starvation: A Learning Proclivity in Neural Networks
☆61Updated 4 years ago
Alternatives and similar repositories for Gradient_Starvation:
Users that are interested in Gradient_Starvation are comparing it to the libraries listed below
- ☆34Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆41Updated last year
- ☆34Updated 3 years ago
- Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization☆85Updated 2 years ago
- Official implementation of paper Gradient Matching for Domain Generalization☆117Updated 3 years ago
- ☆55Updated 4 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆68Updated 8 months ago
- Code for "Supermasks in Superposition"☆121Updated last year
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Updated 6 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆85Updated 4 years ago
- ☆34Updated 5 months ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 2 years ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆39Updated 11 months ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆30Updated 4 years ago
- ☆62Updated 3 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 3 years ago
- Last-layer Laplace approximation code examples☆82Updated 3 years ago
- ☆58Updated 3 years ago
- Sinkhorn Label Allocation is a label assignment method for semi-supervised self-training algorithms. The SLA algorithm is described in fu…☆53Updated 3 years ago
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networks☆49Updated 3 years ago
- Winning Solution of the NeurIPS 2020 Competition on Predicting Generalization in Deep Learning☆38Updated 3 years ago
- ☆35Updated last year
- ☆57Updated last year
- ☆43Updated 2 years ago
- A way to achieve uniform confidence far away from the training data.☆37Updated 3 years ago
- ☆107Updated last year
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago