HoloClean / holocleanLinks
A Machine Learning System for Data Enrichment.
☆525Updated 2 years ago
Alternatives and similar repositories for holoclean
Users that are interested in holoclean are comparing it to the libraries listed below
Sorting:
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels f…☆508Updated 6 months ago
- ☆193Updated last year
- An open source, high scalability toolkit in Java for Entity Resolution.☆221Updated 3 months ago
- ☆79Updated 2 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆249Updated last month
- ☆96Updated 5 years ago
- More interactive weak supervision with FlyingSquid☆315Updated 5 years ago
- Python package for performing Entity and Text Matching using Deep Learning.☆605Updated last year
- python automatic data quality check toolkit☆282Updated 5 years ago
- A collection of tutorials for Snorkel☆404Updated 10 months ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 3 months ago
- A list of free data matching and record linkage software.☆392Updated last year
- detect demographic differences in the output of machine learning models or other assessments☆317Updated 5 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Flow with FlorDB 🌻☆153Updated 2 weeks ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆76Updated 2 years ago
- Type System for Data Analysis in Python☆213Updated 8 months ago
- SparkER: an Entity Resolution framework for Apache Spark☆65Updated last year
- Human-explainable AI.☆525Updated last month
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- The complete graph data science platform☆139Updated 8 months ago
- DeltaPy - Tabular Data Augmentation (by @firmai)☆554Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- Implementation of statistical models to analyze time lagged conversions☆263Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,669Updated 8 months ago
- ☆271Updated last year
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- A Machine Learning System for Data Enrichment.☆75Updated 7 years ago
- Library for Semi-Automated Data Science☆344Updated 3 weeks ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆115Updated last year