data-centric-ai / dcbenchLinks
A benchmark of data-centric tasks from across the machine learning lifecycle.
β71Updated 3 years ago
Alternatives and similar repositories for dcbench
Users that are interested in dcbench are comparing it to the libraries listed below
Sorting:
- β141Updated 2 years ago
- π οΈ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSetβ186Updated last month
- Implementation of Estimating Training Data Influence by Tracing Gradient Descent (NeurIPS 2020)β239Updated 3 years ago
- Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wiβ¦β54Updated 4 years ago
- Model Patching: Closing the Subgroup Performance Gap with Data Augmentationβ42Updated 5 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"β97Updated 2 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labelingβ30Updated 4 years ago
- DISTIL: Deep dIverSified inTeractIve Learning. An active/inter-active learning library built on py-torch for reducing labeling costs.β155Updated 3 years ago
- β96Updated 3 years ago
- Combating hidden stratification with GEORGEβ64Updated 4 years ago
- β111Updated 2 years ago
- Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coβ¦β345Updated 2 years ago
- β212Updated 3 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.β72Updated 3 years ago
- ModelDiff: A Framework for Comparing Learning Algorithmsβ58Updated 2 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles eβ¦β86Updated last month
- Measuring data importance over ML pipelines using the Shapley value.β45Updated 5 months ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119β¦β107Updated last year
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extractionβ36Updated 3 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi β¦β66Updated 5 years ago
- β96Updated 5 years ago
- Official cleanlab repo is at https://github.com/cleanlab/cleanlabβ58Updated 3 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learningβ105Updated 2 years ago
- Active and Sample-Efficient Model Evaluationβ26Updated 8 months ago
- Code for "Supermasks in Superposition"β125Updated 2 years ago
- Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" π§ (ICLR 2019)β129Updated 4 years ago
- Domain Adaptationβ23Updated 4 years ago
- A library to create and manage configuration files, especially for machine learning projects.β79Updated 3 years ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)β43Updated 3 years ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratcβ¦β154Updated 3 years ago