schelterlabs / jenga
Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
☆38Updated last year
Alternatives and similar repositories for jenga:
Users that are interested in jenga are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated 11 months ago
- automatic data slicing☆35Updated 3 years ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆102Updated 10 months ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆46Updated 7 months ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- openclean - Data Cleaning and data profiling library for Python☆72Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated last year
- ☆21Updated last year
- ☆32Updated 3 years ago
- SPEAR: Programmatically label and build training data quickly.☆104Updated 7 months ago
- Python Interface of the Scalable Bayesian Rule Lists☆19Updated 5 years ago
- this repo might get accepted☆29Updated 4 years ago
- ☆94Updated 5 months ago
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆218Updated last year
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated last year
- Explaining Inference Queries with Bayesian Optimization☆10Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Train Gradient Boosting models that are both high-performance *and* Fair!☆103Updated 7 months ago
- A library of Reversible Data Transforms☆123Updated this week
- A Natural Language Interface to Explainable Boosting Machines☆64Updated 7 months ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆50Updated 2 years ago
- ☆32Updated 3 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Public home of pycorels, the python binding to CORELS☆77Updated 4 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- ☆29Updated 3 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- Model Agnostic Counterfactual Explanations☆86Updated 2 years ago