HoloClean / HoloClean-Legacy-deprecated
A Machine Learning System for Data Enrichment.
☆75Updated 6 years ago
Alternatives and similar repositories for HoloClean-Legacy-deprecated:
Users that are interested in HoloClean-Legacy-deprecated are comparing it to the libraries listed below
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 5 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆23Updated 7 years ago
- REx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, a…☆22Updated 6 years ago
- ☆92Updated 9 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- ☆39Updated 8 years ago
- Numba-based version of DimmWitted Gibbs sampler☆45Updated 6 years ago
- ☆52Updated 7 years ago
- An open relation extraction system☆46Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- The Berkeley Entity Resolution System jointly solves the problems of named entity recognition, coreference resolution, and entity linking…☆185Updated 5 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 10 months ago
- ☆75Updated last year
- deep entity resolution lite version☆11Updated 5 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- Automatically labeling training data☆105Updated 6 years ago
- Tools and data for creating DBpedia Spotlight models.☆37Updated 3 years ago
- ☆110Updated 7 years ago
- A collection of simple tutorials for using Fonduer☆99Updated 4 years ago
- A Generalized Data Cleaning System☆49Updated 8 years ago
- Implements dictionary-based entity extraction as described in the FAERIE paper http://dbgroup.cs.tsinghua.edu.cn/dd/papers/sigmod2011-fae…☆9Updated 7 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Code for KGI project☆26Updated 7 years ago
- Distributed Matrix Library☆70Updated 8 years ago
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 3 years ago
- Question Parsing module for the PPP using a grammatical approch☆33Updated 7 years ago
- PyMC version 3 (PyMC 2 is in branch 2.3)☆27Updated 10 years ago
- Analytic UIMA pipelines using Spark☆23Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago