schelterlabs / jenga
Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
☆40Updated last year
Alternatives and similar repositories for jenga:
Users that are interested in jenga are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆47Updated 10 months ago
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆104Updated last year
- Data Cleaning for ML under the Certain Prediction Framework☆11Updated 3 years ago
- automatic data slicing☆34Updated 3 years ago
- openclean - Data Cleaning and data profiling library for Python☆76Updated 3 years ago
- ☆22Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- ☆32Updated 3 years ago
- ☆37Updated 3 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- Model Agnostic Counterfactual Explanations☆87Updated 2 years ago
- A library of Reversible Data Transforms☆124Updated 2 weeks ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- Measuring data importance over ML pipelines using the Shapley value.☆38Updated 2 months ago
- ☆11Updated this week
- SPEAR: Programmatically label and build training data quickly.☆106Updated 9 months ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- A Natural Language Interface to Explainable Boosting Machines☆66Updated 9 months ago
- ☆62Updated 5 months ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.☆52Updated 7 months ago
- repository for R library "sbrlmod"☆25Updated 11 months ago
- Public home of pycorels, the python binding to CORELS☆78Updated 4 years ago
- Characterization of relational table embeddings (VLDB 2024).☆28Updated 9 months ago
- Train Gradient Boosting models that are both high-performance *and* Fair!☆104Updated 10 months ago
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆220Updated last year