socialfoundations / benchbenchLinks
BenchBench is a Python package to evaluate multi-task benchmarks.
☆15Updated last year
Alternatives and similar repositories for benchbench
Users that are interested in benchbench are comparing it to the libraries listed below
Sorting:
- ☆37Updated last year
- ☆45Updated 2 years ago
- ☆34Updated last month
- Recycling diverse models☆45Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated last year
- ☆109Updated 2 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆36Updated 3 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…☆83Updated last month
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago
- Code for paper "Can contrastive learning avoid shortcut solutions?" NeurIPS 2021.☆47Updated 3 years ago
- Wrap around any model to output differentially private prediction sets with finite sample validity on any dataset.☆18Updated last year
- ☆95Updated 2 years ago
- ☆18Updated 2 years ago
- ☆19Updated 3 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆71Updated 2 years ago
- ☆37Updated 3 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆107Updated 2 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- LISA for ICML 2022☆50Updated 2 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- ☆107Updated last year
- ☆58Updated 2 years ago
- Error consistency: a black-box analysis for comparing errors between decision makers (NeurIPS 2020)☆9Updated 2 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year