schelterlabs / jenga
Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptions (e.g., missing values, broken character encodings) on the prediction quality of their ML models.
☆39Updated last year
Alternatives and similar repositories for jenga:
Users that are interested in jenga are comparing it to the libraries listed below
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆46Updated 9 months ago
- ☆21Updated last year
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆103Updated 11 months ago
- automatic data slicing☆35Updated 3 years ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated last year
- openclean - Data Cleaning and data profiling library for Python☆74Updated 3 years ago
- Editing machine learning models to reflect human knowledge and values☆124Updated last year
- Foundation Models for Data Tasks☆102Updated last year
- A Natural Language Interface to Explainable Boosting Machines☆65Updated 8 months ago
- Measuring data importance over ML pipelines using the Shapley value.☆38Updated last month
- Explaining Inference Queries with Bayesian Optimization☆10Updated 4 years ago
- ☆32Updated 3 years ago
- A benchmark of data-centric tasks from across the machine learning lifecycle.☆72Updated 2 years ago
- ☆76Updated 5 months ago
- A practical Active Learning python package with a strong focus on experiments.☆51Updated 2 years ago
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.☆51Updated 6 months ago
- SPEAR: Programmatically label and build training data quickly.☆105Updated 8 months ago
- ☆37Updated 3 years ago
- Repository for "Online Active Model Selection for Pre-trained ML Classifiers"☆15Updated 2 years ago
- Model Agnostic Counterfactual Explanations☆87Updated 2 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆156Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆90Updated last year
- Python Interface of the Scalable Bayesian Rule Lists☆19Updated 5 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆41Updated 2 weeks ago
- ☆19Updated 6 months ago
- Code for the CIKM 2019 Paper "Fast and Accurate Network Embeddings via Very Sparse Random Projection"☆57Updated 5 years ago
- ☆29Updated 3 years ago