chu-data-lab / CPClean
Data Cleaning for ML under the Certain Prediction Framework
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for CPClean
- Inspect ML Pipelines in Python in the form of a DAG☆68Updated 8 months ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆35Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆44Updated 4 months ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆102Updated 7 months ago
- ☆20Updated last year
- DeepEverest: a system for efficient DNN interpretation.☆13Updated 9 months ago
- (ICML 2021) Mandoline: Model Evaluation under Distribution Shift☆31Updated 3 years ago
- Model Agnostic Counterfactual Explanations☆87Updated 2 years ago
- Extra functionalities for river☆14Updated 5 months ago
- Measuring data importance over ML pipelines using the Shapley value.☆36Updated last week
- Supervised Local Modeling for Interpretability☆28Updated 6 years ago
- A collection of implementations of fair ML algorithms☆12Updated 6 years ago
- SPEAR: Programmatically label and build training data quickly.☆103Updated 4 months ago
- An automated machine learning tool aimed to facilitate AutoML research.☆95Updated 2 months ago
- Python Interface of the Scalable Bayesian Rule Lists☆19Updated 4 years ago
- Explaining Inference Queries with Bayesian Optimization☆10Updated 3 years ago
- A practical Active Learning python package with a strong focus on experiments.☆51Updated 2 years ago
- CAIPI turns LIMEs into trust!☆12Updated 4 years ago
- ☆32Updated 3 years ago
- Public home of pycorels, the python binding to CORELS☆75Updated 4 years ago
- Pipeline Explorer - Explore and analyze millions of pipelines learned using MLBlocks and MLPrimitives.☆17Updated last year
- Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning (AISTATS 2022 Oral)☆38Updated 2 years ago
- Editing machine learning models to reflect human knowledge and values☆123Updated last year
- A collection of data sets for stream learning.☆32Updated 4 years ago
- Repository for "Online Active Model Selection for Pre-trained ML Classifiers"☆15Updated last year
- A Tree Search Library for Data Cleaning☆21Updated 2 years ago
- Repository for code release of paper "Robust Variational Autoencoders for Outlier Detection and Repair of Mixed-Type Data" (AISTATS 2020)☆50Updated 4 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated 8 months ago
- A Python Package providing two algorithms, DAME and FLAME, for fast and interpretable treatment-control matches of categorical data☆57Updated 5 months ago
- CEML - Counterfactuals for Explaining Machine Learning models - A Python toolbox☆42Updated 3 months ago