mlcommons / dataperf
Data Benchmarking
☆19Updated 9 months ago
Alternatives and similar repositories for dataperf:
Users that are interested in dataperf are comparing it to the libraries listed below
- In-context Example Selection with Influences☆15Updated last year
- Code for the ICLR 2021 Paper "In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness"☆12Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- ☆22Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆41Updated 2 weeks ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆95Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- Recycling diverse models☆44Updated 2 years ago
- ☆26Updated 3 weeks ago
- Official Repository for Dataset Inference for LLMs☆32Updated 7 months ago
- Few-shot Learning with Auxiliary Data☆27Updated last year
- ☆17Updated 2 years ago
- A weak supervision framework for (partial) labeling functions☆16Updated 8 months ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆20Updated last week
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- ☆28Updated last year
- Tasks for describing differences between text distributions.☆16Updated 7 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- ☆15Updated 6 months ago
- Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"☆13Updated 5 months ago
- ☆34Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆18Updated 2 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated 5 months ago
- ☆23Updated 10 months ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- ☆87Updated last year
- Foundation Models for Data Tasks☆102Updated last year