drangons / entity_resolution_spark
Collection of some algorithms for entity resolution
☆28Updated 9 years ago
Alternatives and similar repositories for entity_resolution_spark:
Users that are interested in entity_resolution_spark are comparing it to the libraries listed below
- SparkER: an Entity Resolution framework for Apache Spark☆64Updated last year
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Topic Modeling on Apache Spark☆95Updated 6 years ago
- deep entity resolution lite version☆11Updated 5 years ago
- NOUS: Construction, Querying and Reasoning with Knowledge Graphs☆71Updated 2 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- just a prototype☆32Updated 9 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 9 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Updated 9 years ago
- An implementation of Markov Clustering algorithm for Spark in Scala☆34Updated 7 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- An example of Spark and GraphX with Twitter as sample☆19Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Semantic Preserving Embeddings for Generalized Graphs☆31Updated 6 years ago
- Semantic Entity Retrieval Toolkit☆109Updated 7 years ago
- An open relation extraction system☆46Updated 3 years ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Knowledge extraction from web data☆92Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- DBpedia.org RDF to CSV for import into Neo4j☆52Updated 10 years ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Spark algorithms for building k-nn graphs☆42Updated 6 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 7 years ago
- ☆16Updated 4 years ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆22Updated 3 years ago