socialfoundations / benchbenchLinks
BenchBench is a Python package to evaluate multi-task benchmarks.
☆16Updated last year
Alternatives and similar repositories for benchbench
Users that are interested in benchbench are comparing it to the libraries listed below
Sorting:
- ☆37Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated 2 years ago
- ☆34Updated 3 months ago
- ☆46Updated 2 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆35Updated 2 years ago
- Recycling diverse models☆45Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆108Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- ☆18Updated 2 years ago
- Distilling Model Failures as Directions in Latent Space☆47Updated 2 years ago
- ☆19Updated 3 years ago
- ☆95Updated 2 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…☆83Updated 2 months ago
- ☆30Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 2 years ago
- ☆108Updated last year
- Code accompanying paper: Meta-Learning to Improve Pre-Training☆37Updated 3 years ago
- Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)☆16Updated 2 years ago
- ☆37Updated 3 years ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated last year
- ☆55Updated 5 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆107Updated 2 years ago
- Natural Language Descriptions of Deep Visual Features, ICLR 2022☆65Updated 2 years ago
- Wrap around any model to output differentially private prediction sets with finite sample validity on any dataset.☆18Updated last year
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- ☆36Updated 2 years ago