tml-epfl / sam-low-rank-features
Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]
☆27Updated last year
Alternatives and similar repositories for sam-low-rank-features:
Users that are interested in sam-low-rank-features are comparing it to the libraries listed below
- ☆35Updated 2 years ago
- ☆57Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆34Updated last year
- ☆11Updated 2 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆55Updated last year
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Updated 2 years ago
- Implementation of Beyond Neural Scaling beating power laws for deep models and prototype-based models☆33Updated 3 months ago
- Codebase used in the paper "Foundational Models for Continual Learning: An Empirical Study of Latent Replay".☆30Updated 2 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆20Updated 6 months ago
- The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from t…☆78Updated 2 years ago
- This repository is the official implementation of Dataset Condensation with Contrastive Signals (DCC), accepted at ICML 2022.☆20Updated 2 years ago
- ☆22Updated 2 years ago
- SparCL: Sparse Continual Learning on the Edge @ NeurIPS 22☆29Updated last year
- ☆14Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆97Updated last year
- Code for the paper "A Light Recipe to Train Robust Vision Transformers" [SaTML 2023]☆53Updated 2 years ago
- ☆107Updated last year
- ☆84Updated 2 years ago
- ☆28Updated 8 months ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- Training vision models with full-batch gradient descent and regularization☆37Updated 2 years ago
- Test-Time Adaptation via Conjugate Pseudo-Labels☆39Updated last year
- ☆18Updated last year
- ☆42Updated 4 months ago
- Host CIFAR-10.2 Data Set☆13Updated 3 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆18Updated 10 months ago
- Recycling diverse models☆44Updated 2 years ago
- What do we learn from inverting CLIP models?☆52Updated last year