JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for LAWA
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆28Updated last year
- ☆17Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Code for T-MARS data filtering☆35Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- ☆34Updated 9 months ago
- ☆15Updated 2 years ago
- Recycling diverse models☆44Updated last year
- ☆41Updated last year
- ☆34Updated 3 months ago
- Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation☆43Updated last year
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- ☆26Updated 2 years ago
- ☆25Updated 4 months ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Updated 11 months ago
- PRIME: A Few Primitives Can Boost Robustness to Common Corruptions☆42Updated last year
- ☆58Updated last year
- ☆51Updated 5 months ago
- ☆13Updated 8 months ago
- Training vision models with full-batch gradient descent and regularization☆38Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆20Updated last year
- ☆34Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆37Updated last year
- ☆21Updated last year
- Distilling Model Failures as Directions in Latent Space☆45Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆91Updated last year
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- ☆15Updated 4 months ago