socialfoundations / benchbenchLinks
BenchBench is a Python package to evaluate multi-task benchmarks.
☆15Updated 10 months ago
Alternatives and similar repositories for benchbench
Users that are interested in benchbench are comparing it to the libraries listed below
Sorting:
- Recycling diverse models☆44Updated 2 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- ☆18Updated 2 years ago
- ☆45Updated 2 years ago
- Post-processing for fair classification☆14Updated last month
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆36Updated 3 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- ☆35Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆16Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆16Updated last year
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Active and Sample-Efficient Model Evaluation☆24Updated 2 weeks ago
- ☆25Updated 5 years ago
- Fine-grained ImageNet annotations☆29Updated 5 years ago
- ☆34Updated last week
- ☆19Updated 3 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Updated 4 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 4 years ago
- ☆38Updated 3 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Updated last year
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated 2 years ago
- Wrap around any model to output differentially private prediction sets with finite sample validity on any dataset.☆17Updated last year
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆16Updated 2 years ago
- LISA for ICML 2022☆49Updated 2 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆36Updated 2 years ago
- ☆24Updated 4 years ago