HoloClean / holoclean
A Machine Learning System for Data Enrichment.
☆520Updated last year
Alternatives and similar repositories for holoclean:
Users that are interested in holoclean are comparing it to the libraries listed below
- ☆77Updated 2 years ago
- More interactive weak supervision with FlyingSquid☆315Updated 4 years ago
- python library for automated dataset normalization☆114Updated last year
- Flow with FlorDB 🌻☆155Updated 2 months ago
- ☆189Updated 10 months ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- Source code for several Metanome data profiling algorithms☆53Updated last year
- DeltaPy - Tabular Data Augmentation (by @firmai)☆543Updated last year
- A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels f…☆505Updated 3 weeks ago
- A Benchmark for Joint Data Cleaning and Machine Learning☆47Updated 10 months ago
- Data Analysis Baseline Library☆726Updated 4 months ago
- An open source, high scalability toolkit in Java for Entity Resolution.☆218Updated last year
- A collection of tutorials for Snorkel☆395Updated 5 months ago
- Python library for building highly effective data science workflows☆950Updated last year
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆726Updated 2 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 2 months ago
- Bias Auditing & Fair ML Toolkit☆715Updated last month
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆239Updated last month
- Coarse-grained lineage and tracing for machine learning pipelines.☆469Updated 2 years ago
- DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)☆205Updated 3 years ago
- Implementation of statistical models to analyze time lagged conversions☆261Updated 11 months ago
- ☆96Updated 5 years ago
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,221Updated 2 months ago
- Type System for Data Analysis in Python☆211Updated 2 months ago
- What's in your data? Extract schema, statistics and entities from datasets☆1,477Updated last month
- Generate and Visualize Data Lineage from query history☆322Updated last year
- Python package for performing Entity and Text Matching using Deep Learning.☆587Updated 10 months ago
- ☆17Updated 9 years ago
- ☆265Updated last year