stefan-grafberger / mlinspectLinks
Inspect ML Pipelines in Python in the form of a DAG
☆70Updated last year
Alternatives and similar repositories for mlinspect
Users that are interested in mlinspect are comparing it to the libraries listed below
Sorting:
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆41Updated 2 years ago
- ☆29Updated 3 years ago
- Flow with FlorDB 🌻☆154Updated 2 months ago
- Editing machine learning models to reflect human knowledge and values☆127Updated last year
- ☆22Updated last year
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated 2 years ago
- Distribution transparent Machine Learning experiments on Apache Spark☆91Updated last year
- this repo might get accepted☆28Updated 4 years ago
- Data-Centric What-If Analysis for Native Machine Learning Pipelines☆16Updated 2 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆75Updated 2 years ago
- openclean - Data Cleaning and data profiling library for Python☆80Updated 3 years ago
- Public home of pycorels, the python binding to CORELS☆80Updated 5 years ago
- A library of Reversible Data Transforms☆127Updated last week
- Code repository for our paper "Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift": https://arxiv.org/abs/1810.119…☆105Updated last year
- The official implementation of "The Shapley Value of Classifiers in Ensemble Games" (CIKM 2021).☆220Updated 2 years ago
- Weakly Supervised End-to-End Learning (NeurIPS 2021)☆157Updated 2 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆172Updated 2 years ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆470Updated 2 years ago
- SPEAR: Programmatically label and build training data quickly.☆107Updated last year
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 8 years ago
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Updated 3 years ago
- Unified slicing for all Python data structures.☆35Updated 5 months ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 3 weeks ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Repository for my master thesis on automated string handling☆16Updated 4 years ago
- Picket is a system that safeguards against data corruptions during both training and deployment of machine learning models over tabular d…☆14Updated 4 years ago
- automatic data slicing☆34Updated 3 years ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.☆54Updated 11 months ago
- Extra functionalities for river☆14Updated last year
- More interactive weak supervision with FlyingSquid☆315Updated 4 years ago