chu-data-lab / CPClean
Data Cleaning for ML under the Certain Prediction Framework
☆11Updated 3 years ago
Alternatives and similar repositories for CPClean:
Users that are interested in CPClean are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- ☆21Updated last year
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆103Updated 11 months ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆38Updated last month
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆40Updated 2 years ago
- DeepEverest: a system for efficient DNN interpretation.☆12Updated last year
- A Natural Language Interface to Explainable Boosting Machines☆65Updated 8 months ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆39Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆46Updated 9 months ago
- Model Agnostic Counterfactual Explanations☆87Updated 2 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆41Updated 2 weeks ago
- Distributional Shapley: A Distributional Framework for Data Valuation☆30Updated 10 months ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- Testing Language Models for Memorization of Tabular Datasets.☆33Updated last month
- 💱 A curated list of data valuation (DV) to design your next data marketplace☆117Updated last month
- Data-SUITE: Data-centric identification of in-distribution incongruous examples (ICML 2022)☆10Updated 2 years ago
- Neural Additive Models (Google Research)☆26Updated 10 months ago
- Beta calibration☆29Updated last year
- Influence Estimation for Gradient-Boosted Decision Trees☆26Updated 9 months ago
- ☆20Updated 5 years ago
- CEML - Counterfactuals for Explaining Machine Learning models - A Python toolbox☆43Updated 7 months ago
- ☆17Updated 4 years ago
- Supervised Local Modeling for Interpretability☆28Updated 6 years ago
- Code/figures in Right for the Right Reasons☆55Updated 4 years ago
- SPEAR: Programmatically label and build training data quickly.☆105Updated 8 months ago
- automatic data slicing☆35Updated 3 years ago
- Hyperparameter tuning via uncertainty modeling☆47Updated 10 months ago
- TabDPT: Scaling Tabular Foundation Models☆26Updated last week