HoloClean / holoclean
A Machine Learning System for Data Enrichment.
☆518Updated last year
Related projects ⓘ
Alternatives and complementary repositories for holoclean
- What's in your data? Extract schema, statistics and entities from datasets☆1,434Updated last week
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels f…☆498Updated 2 weeks ago
- More interactive weak supervision with FlyingSquid☆315Updated 4 years ago
- Type System for Data Analysis in Python☆209Updated 3 months ago
- ☆185Updated 5 months ago
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆725Updated last year
- A model-agnostic visual debugging tool for machine learning☆1,650Updated last year
- Generate and Visualize Data Lineage from query history☆311Updated last year
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆230Updated this week
- Random dataframe and database table generator☆303Updated 3 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆497Updated 2 months ago
- python automatic data quality check toolkit☆285Updated 4 years ago
- DeltaPy - Tabular Data Augmentation (by @firmai)☆537Updated last year
- Python library for building highly effective data science workflows☆952Updated last year
- Source code/webpage/demos for the What-If Tool☆920Updated 2 months ago
- Data Analysis Baseline Library☆724Updated 3 months ago
- Python package for performing Entity and Text Matching using Deep Learning.☆568Updated 5 months ago
- 🌻 Flow with FlorDB☆151Updated 2 months ago
- Implementation of statistical models to analyze time lagged conversions☆259Updated 6 months ago
- The complete graph data science platform☆139Updated last week
- Joblib Apache Spark Backend☆242Updated 3 months ago
- MLeap: Deploy ML Pipelines to Production☆1,504Updated last week
- MacroBase: A Search Engine for Fast Data☆661Updated last year
- ☆74Updated last year
- Synthetic data generators for structured and unstructured text, featuring differentially private learning.☆596Updated this week
- Tool to automate data quality checks on data pipelines☆249Updated 2 years ago
- machine learning with logical rules in Python☆625Updated 9 months ago
- Interpret Community extends Interpret repository with additional interpretability techniques and utility functions to handle real-world d…☆421Updated 5 months ago
- A collection of tutorials for Snorkel☆392Updated this week