socialfoundations / benchbench
BenchBench is a Python package to evaluate multi-task benchmarks.
☆15Updated 9 months ago
Alternatives and similar repositories for benchbench:
Users that are interested in benchbench are comparing it to the libraries listed below
- ☆35Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- ☆17Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Recycling diverse models☆44Updated 2 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- ☆45Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated last year
- Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)☆15Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆29Updated 2 years ago
- ☆18Updated 9 months ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆34Updated last year
- ☆34Updated last week
- ☆37Updated 3 years ago
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- ☆38Updated 3 years ago
- ☆19Updated 3 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- Post-processing for fair classification☆13Updated last week
- Wrap around any model to output differentially private prediction sets with finite sample validity on any dataset.☆17Updated last year
- A simple Jax implementation of influence functions.☆16Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Official code for "In Search of Robust Measures of Generalization" (NeurIPS 2020)☆28Updated 4 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- ☆26Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated last year