easeml / datascope
Measuring data importance over ML pipelines using the Shapley value.
☆37Updated 2 months ago
Alternatives and similar repositories for datascope:
Users that are interested in datascope are comparing it to the libraries listed below
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- ☆32Updated 3 years ago
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆26Updated 4 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Influence Estimation for Gradient-Boosted Decision Trees☆26Updated 7 months ago
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆25Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆24Updated 3 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆54Updated last year
- ☆17Updated 4 years ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆18Updated last year
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆102Updated 9 months ago
- ☆28Updated last year
- Domain Adaptation☆23Updated 3 years ago
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆60Updated last year
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Updated 3 years ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated this week
- Minimal, standalone library for solving GLMs in PyTorch☆24Updated 3 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 3 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆94Updated last year
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated 8 months ago
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- Training and evaluating NBM and SPAM for interpretable machine learning.☆75Updated last year
- ☆35Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 8 months ago
- Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vi…☆60Updated last year
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆11Updated 7 months ago
- 😇A curated list of links and resources for Fair ML and Data Ethics☆18Updated 2 years ago
- TabDPT: Scaling Tabular Foundation Models☆21Updated this week
- Code/figures in Right for the Right Reasons☆55Updated 4 years ago