mpezeshki / Gradient_StarvationLinks
Gradient Starvation: A Learning Proclivity in Neural Networks
☆61Updated 4 years ago
Alternatives and similar repositories for Gradient_Starvation
Users that are interested in Gradient_Starvation are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization☆86Updated 3 years ago
- ☆55Updated 4 years ago
- ☆34Updated 4 years ago
- ☆34Updated 3 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- ☆35Updated last year
- ☆58Updated 3 years ago
- Code for "Supermasks in Superposition"☆124Updated last year
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆24Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Updated 6 years ago
- Official implementation of paper Gradient Matching for Domain Generalization☆122Updated 3 years ago
- Sinkhorn Label Allocation is a label assignment method for semi-supervised self-training algorithms. The SLA algorithm is described in fu…☆53Updated 3 years ago
- Official Implementation of Remembering for the Right Reasons (ICLR 2021)☆30Updated 3 years ago
- ☆45Updated 2 years ago
- ☆19Updated 3 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆72Updated last year
- ☆34Updated this week
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- ☆107Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- IIRC: Incremental Implicitly Refined Classification☆30Updated 2 years ago
- Implementation of Invariant Risk Minimization https://arxiv.org/abs/1907.02893☆88Updated 5 years ago
- ☆38Updated 7 months ago
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networks☆49Updated 4 years ago
- An Investigation of Why Overparameterization Exacerbates Spurious Correlations☆31Updated 4 years ago
- Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.☆33Updated last year
- ☆38Updated 3 years ago
- A way to achieve uniform confidence far away from the training data.☆38Updated 4 years ago
- Last-layer Laplace approximation code examples☆82Updated 3 years ago