socialfoundations / benchbench
BenchBench is a Python package to evaluate multi-task benchmarks.
☆15Updated 8 months ago
Alternatives and similar repositories for benchbench:
Users that are interested in benchbench are comparing it to the libraries listed below
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- ☆34Updated last week
- ☆44Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- ☆35Updated last year
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- Distilling Model Failures as Directions in Latent Space☆46Updated 2 years ago
- LISA for ICML 2022☆47Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- ☆17Updated 2 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- ☆24Updated 4 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated last year
- Personal implementation of ASIF by Antonio Norelli☆25Updated 10 months ago
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- Post-processing for fair classification☆13Updated 2 months ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 3 years ago
- ☆38Updated 3 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆29Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- ☆28Updated 8 months ago
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆35Updated 2 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆52Updated last year
- ☆18Updated 8 months ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆25Updated last year
- ☆107Updated last year
- ☆19Updated 2 years ago