schelterlabs / jenga
Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
☆39Updated last year
Alternatives and similar repositories for jenga:
Users that are interested in jenga are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- automatic data slicing☆35Updated 3 years ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆103Updated last year
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆46Updated 9 months ago
- ☆32Updated 3 years ago
- Train Gradient Boosting models that are both high-performance *and* Fair!☆103Updated 9 months ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- ☆11Updated 3 weeks ago
- SPEAR: Programmatically label and build training data quickly.☆105Updated 9 months ago
- Explaining Inference Queries with Bayesian Optimization☆10Updated 4 years ago
- ☆19Updated 7 months ago
- KEN: Relational Data Embeddings☆28Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- A Scalable Auto-ML System☆53Updated 2 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆228Updated this week
- Python Interface of the Scalable Bayesian Rule Lists☆19Updated 5 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆38Updated last month
- openclean - Data Cleaning and data profiling library for Python☆75Updated 3 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- A practical Active Learning python package with a strong focus on experiments.☆51Updated 2 years ago
- Public home of pycorels, the python binding to CORELS☆77Updated 4 years ago
- ☆22Updated last year
- Code for the CIKM 2019 Paper "Fast and Accurate Network Embeddings via Very Sparse Random Projection"☆57Updated 5 years ago
- A Tree Search Library for Data Cleaning☆22Updated 3 years ago
- ☆101Updated 6 months ago
- Model Agnostic Counterfactual Explanations☆87Updated 2 years ago
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆219Updated last year