easeml / datascope
Measuring data importance over ML pipelines using the Shapley value.
☆42Updated 3 months ago
Alternatives and similar repositories for datascope
Users that are interested in datascope are comparing it to the libraries listed below
Sorting:
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆41Updated 2 years ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- ☆35Updated last year
- Official Repo for "Efficient task-specific data valuation for nearest neighbor algorithms"☆26Updated 5 years ago
- ☆32Updated 3 years ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 3 months ago
- ☆22Updated last year
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated last year
- automatic data slicing☆34Updated 3 years ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆104Updated last year
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆66Updated 2 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆96Updated 3 months ago
- Active and Sample-Efficient Model Evaluation☆24Updated 4 years ago
- ☆89Updated last week
- A lightweight implementation of removal-based explanations for ML models.☆59Updated 3 years ago
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆118Updated 2 months ago
- ☆17Updated 4 years ago
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆27Updated 3 years ago
- Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vi…☆66Updated 2 years ago
- 😇A curated list of links and resources for Fair ML and Data Ethics☆18Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- Influence Estimation for Gradient-Boosted Decision Trees☆27Updated 11 months ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆39Updated last year
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆28Updated 4 years ago
- Training and evaluating NBM and SPAM for interpretable machine learning.☆78Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.☆106Updated 10 months ago
- Logic Explained Networks is a python repository implementing explainable-by-design deep learning models.☆49Updated last year
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆47Updated last year