HoloClean / holocleanLinks
A Machine Learning System for Data Enrichment.
☆525Updated 2 years ago
Alternatives and similar repositories for holoclean
Users that are interested in holoclean are comparing it to the libraries listed below
Sorting:
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels f…☆508Updated 5 months ago
- ☆193Updated last year
- More interactive weak supervision with FlyingSquid☆315Updated 5 years ago
- ☆79Updated 2 years ago
- python automatic data quality check toolkit☆282Updated 5 years ago
- ☆96Updated 5 years ago
- Flow with FlorDB 🌻☆154Updated this week
- The complete graph data science platform☆139Updated 7 months ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆221Updated 2 months ago
- Python package for performing Entity and Text Matching using Deep Learning.☆604Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆76Updated 2 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆248Updated 3 weeks ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 2 months ago
- A collection of tutorials for Snorkel☆403Updated 10 months ago
- Type System for Data Analysis in Python☆213Updated 7 months ago
- Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).☆529Updated 5 years ago
- DeltaPy - Tabular Data Augmentation (by @firmai)☆552Updated 2 years ago
- Python library for building highly effective data science workflows☆949Updated 2 years ago
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆728Updated 2 years ago
- A list of free data matching and record linkage software.☆392Updated last year
- Coarse-grained lineage and tracing for machine learning pipelines.☆469Updated 2 years ago
- What's in your data? Extract schema, statistics and entities from datasets☆1,516Updated this week
- detect demographic differences in the output of machine learning models or other assessments☆318Updated 5 years ago
- python library for automated dataset normalization☆116Updated 2 years ago
- Human-explainable AI.☆526Updated last week
- openclean - Data Cleaning and data profiling library for Python☆80Updated 3 years ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆49Updated last year
- DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)☆205Updated 3 years ago
- Library for Semi-Automated Data Science☆343Updated this week
- Visual Exploration of Automated Machine Learning with ATMSeer☆169Updated 2 years ago