mlcommons / dataperfLinks
Data Benchmarking
☆22Updated last year
Alternatives and similar repositories for dataperf
Users that are interested in dataperf are comparing it to the libraries listed below
Sorting:
- Foundation Models for Data Tasks☆108Updated 2 years ago
- Official Python client library for the OpenReview API☆203Updated this week
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆224Updated last year
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆19Updated last month
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- ☆30Updated 2 years ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆111Updated 2 years ago
- ☆100Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆217Updated 10 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 2 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆53Updated 2 years ago
- Official Repository for Dataset Inference for LLMs☆41Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆54Updated 2 years ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆85Updated this week
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆23Updated 6 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆108Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Updated 2 years ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆82Updated last year
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 3 years ago
- ☆28Updated 6 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆31Updated 11 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆47Updated last year
- ☆26Updated last year
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆87Updated last year
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆25Updated 3 years ago