easeml / datascopeLinks
Measuring data importance over ML pipelines using the Shapley value.
☆42Updated last month
Alternatives and similar repositories for datascope
Users that are interested in datascope are comparing it to the libraries listed below
Sorting:
- TabDPT: Scaling Tabular Foundation Models☆30Updated 2 months ago
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆41Updated 2 years ago
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated 4 months ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆30Updated 4 years ago
- automatic data slicing☆34Updated 3 years ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆105Updated last year
- ☆35Updated last year
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 3 years ago
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆97Updated 4 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆57Updated last year
- Conformal prediction for controlling monotonic risk functions. Simple accompanying PyTorch code for conformal risk control in computer vi…☆67Updated 2 years ago
- ☆22Updated last year
- PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuatio…☆27Updated 3 years ago
- A repo for transfer learning with deep tabular models☆104Updated 2 years ago
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆47Updated last year
- Research on Tabular Foundation Models☆52Updated 6 months ago
- A practical Active Learning python package with a strong focus on experiments.☆51Updated 2 years ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated last year
- XAI-Bench is a library for benchmarking feature attribution explainability techniques☆68Updated 2 years ago
- Revisiting Pretrarining Objectives for Tabular Deep Learning☆63Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- Data-SUITE: Data-centric identification of in-distribution incongruous examples (ICML 2022)☆10Updated 2 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- Recycling diverse models☆44Updated 2 years ago
- Influence Estimation for Gradient-Boosted Decision Trees☆27Updated last year
- Active and Sample-Efficient Model Evaluation☆24Updated last month
- ☆37Updated 3 years ago
- ☆23Updated 2 years ago
- Drift-Resilient TabPFN is a method using In-Context Learning via a Prior-Data Fitted Network, to address temporal distribution shifts in …☆15Updated last month
- ☆138Updated last year