mlcommons / dataperfLinks
Data Benchmarking
☆23Updated last year
Alternatives and similar repositories for dataperf
Users that are interested in dataperf are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆226Updated last year
- ☆31Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆83Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆222Updated last year
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆41Updated last year
- ☆103Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆111Updated last year
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆54Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆106Updated 2 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆14Updated 11 months ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Updated 2 years ago
- Official Python client library for the OpenReview API☆214Updated last week
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Updated 2 years ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆137Updated last year
- ☆29Updated 9 months ago
- Official Repository for Dataset Inference for LLMs☆43Updated last year
- ☆27Updated last year
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 3 years ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Updated 2 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Updated 2 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆47Updated 2 years ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆215Updated 2 weeks ago
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆50Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆99Updated 4 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆57Updated 2 years ago