data-centric-ai / dcbench
A benchmark of data-centric tasks from across the machine learning lifecycle.
☆72Updated 2 years ago
Alternatives and similar repositories for dcbench:
Users that are interested in dcbench are comparing it to the libraries listed below
- ☆136Updated last year
- Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…☆52Updated 3 years ago
- ☆62Updated 3 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- Combating hidden stratification with GEORGE☆63Updated 3 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…☆81Updated last week
- Data for "Datamodels: Predicting Predictions with Training Data"☆95Updated last year
- ☆88Updated last year
- ☆95Updated 2 years ago
- ☆27Updated 7 months ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆103Updated 11 months ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆71Updated 2 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆38Updated last month
- 🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet☆182Updated 2 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- Reusable BatchBALD implementation☆78Updated last year
- DISTIL: Deep dIverSified inTeractIve Learning. An active/inter-active learning library built on py-torch for reducing labeling costs.☆148Updated 2 years ago
- ☆105Updated last year
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)☆227Updated 3 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- ☆35Updated last year
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentation☆42Updated 4 years ago
- Code for "Supermasks in Superposition"☆121Updated last year
- A supplementary code for Editable Neural Networks, an ICLR 2020 submission.☆46Updated 5 years ago
- MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts (ICLR 2022)☆109Updated 2 years ago
- ☆54Updated 4 years ago
- NumPy library for calibration metrics☆69Updated 3 weeks ago
- An active learning library for Pytorch based on Lightning-Fabric.☆79Updated 10 months ago