easeml / datascope
Measuring data importance over ML pipelines using the Shapley value.
☆36Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for datascope
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆26Updated 4 years ago
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆25Updated 2 years ago
- ☆32Updated 3 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆87Updated 3 months ago
- ☆17Updated 4 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Updated 3 years ago
- ☆22Updated last year
- ☆35Updated last year
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated 6 months ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆102Updated 7 months ago
- ☆21Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆90Updated last year
- CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.☆76Updated 8 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆53Updated last year
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- Training and evaluating NBM and SPAM for interpretable machine learning.☆76Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 7 months ago
- ☆86Updated last year
- Influence Estimation for Gradient-Boosted Decision Trees☆25Updated 5 months ago
- A simple PyTorch implementation of influence functions.☆79Updated 5 months ago
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆27Updated 3 years ago
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆57Updated last year
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆35Updated 2 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.☆103Updated 4 months ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆109Updated this week
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆11Updated 5 months ago