LOG-postech / SAM-overparam
Code for reproducing the results from arXiv paper "Critical Influence of Overparameterization on Sharpness-aware Minimization"
☆14Updated 6 months ago
Alternatives and similar repositories for SAM-overparam:
Users that are interested in SAM-overparam are comparing it to the libraries listed below
- 🔨 Malet (Machine Learning Experiment Tool) is a tool for efficient machine learning experiment execution, logging, analysis, and plot ma…☆17Updated this week
- ☆23Updated 11 months ago
- ☆23Updated 3 months ago
- ☆34Updated 11 months ago
- ☆40Updated 2 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- ☆16Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆33Updated 2 years ago
- ☆62Updated last month
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆69Updated 8 months ago
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"☆12Updated 2 years ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆39Updated 11 months ago
- ☆38Updated 2 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆25Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- Official PyTorch implementation of "Robust Deep Learning from Crowds with Belief Propagation"☆18Updated 2 years ago
- Training vision models with full-batch gradient descent and regularization☆37Updated last year
- ☆107Updated last year
- Lookahead: A Far-sighted Alternative of Magnitude-based Pruning (ICLR 2020)☆33Updated 4 years ago
- tiny-imagenet dataset downloader & reader using tensorflow_datasets (tfds) api☆21Updated 5 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Updated 2 years ago
- ☆57Updated last year
- Pytorch implementation of neural processes and variants☆27Updated 5 months ago
- ☆23Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆19Updated 3 years ago
- Pytorch implementation of "Large-Scale Meta-Learning with Continual Trajectory Shifting" (ICML 2021)☆17Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆41Updated last year
- ☆14Updated 3 years ago