drangons / entity_resolution_spark
Collection of some algorithms for entity resolution
☆28Updated 9 years ago
Alternatives and similar repositories for entity_resolution_spark:
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below
- SparkER: an Entity Resolution framework for Apache Spark☆63Updated 10 months ago
- deep entity resolution lite version☆11Updated 5 years ago
- Tutorial code and data for the entity resolution workshops.☆43Updated 9 years ago
- DBpedia.org RDF to CSV for import into Neo4j☆51Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- NOUS: Construction, Querying and Reasoning with Knowledge Graphs☆71Updated 2 years ago
- End-to-End Deep Entity Resolution☆31Updated 3 years ago
- just a prototype☆32Updated 9 years ago
- An open relation extraction system☆46Updated 3 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Analytic UIMA pipelines using Spark☆23Updated 9 years ago
- ☆11Updated 7 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Semantic Preserving Embeddings for Generalized Graphs☆31Updated 6 years ago
- Record Linkage ToolKit (Find and link entities)☆108Updated last year
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆28Updated last year
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- ☆15Updated 2 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21Updated 6 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Graph Processing Algorithms on top of Neo4j☆39Updated 7 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- ☆54Updated 6 years ago
- ☆75Updated last year
- Stanford Entity-Resolution Framework☆23Updated 6 years ago