DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
Alternatives and similar repositories for diwa
Users that are interested in diwa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31May 1, 2025Updated 10 months ago
- Recycling diverse models☆46Jan 18, 2023Updated 3 years ago
- ☆18Jun 20, 2024Updated last year
- ☆12Jul 17, 2023Updated 2 years ago
- LISA for ICML 2022☆52Apr 12, 2023Updated 2 years ago
- ☆44Oct 30, 2025Updated 4 months ago
- ☆33Jul 8, 2024Updated last year
- Invariant-feature Subspace Recovery (ISR)☆23Sep 23, 2022Updated 3 years ago
- Official Implementation of SWAD (NeurIPS 2021)☆170Dec 10, 2022Updated 3 years ago
- ☆45Nov 4, 2020Updated 5 years ago
- ☆18Oct 29, 2021Updated 4 years ago
- ☆12Sep 29, 2019Updated 6 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆26Apr 11, 2023Updated 2 years ago
- Model Stock: All we need is just a few fine-tuned models☆129Aug 9, 2025Updated 7 months ago
- ☆48Jan 17, 2023Updated 3 years ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Nov 27, 2023Updated 2 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆27Feb 24, 2025Updated last year
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time☆510Jul 15, 2024Updated last year
- Learning Representations that Support Robust Transfer of Predictors☆20Nov 7, 2021Updated 4 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- [ICML'24] Open-Vocabulary Calibration for Fine-tuned CLIP☆18Jun 14, 2024Updated last year
- ☆110Sep 20, 2023Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- MDL Complexity computations and experiments from the paper "Revisiting complexity and the bias-variance tradeoff".☆18Jun 12, 2023Updated 2 years ago
- ☆38Jul 13, 2022Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆43Oct 24, 2023Updated 2 years ago
- Code for paper "Adversarial Support Alignment"☆23Apr 22, 2022Updated 3 years ago
- ☆73Jun 27, 2023Updated 2 years ago
- Source code for the Nature Machine Intelligence paper: When and how convolutional neural networks generalize to out-of-distribution categ…☆24Feb 26, 2022Updated 4 years ago
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Feb 15, 2023Updated 3 years ago
- ICML 2020, Estimating Generalization under Distribution Shifts via Domain-Invariant Representations☆23Jun 30, 2020Updated 5 years ago
- Toy datasets to evaluate algorithms for domain generalization and invariance learning.☆43Dec 5, 2021Updated 4 years ago
- This is an implementation of the DeepView framework that was presented in the paper Schulz, A., Hinder, F., & Hammer, B. (2020): https://…☆21Nov 27, 2025Updated 3 months ago
- ACCV2022 Source Code of paper "Feature Decoupled Knowledge Distillation via Spatial Pyramid Pooling"☆12Jul 5, 2023Updated 2 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Predicting Out-of-Distribution Error with the Projection Norm☆19Jul 27, 2022Updated 3 years ago