YannBrrd / elasticsearch-entity-resolution
Elasticsearch entity resolution plugin based on Duke
☆210Updated 4 years ago
Alternatives and similar repositories for elasticsearch-entity-resolution
Users that are interested in elasticsearch-entity-resolution are comparing it to the libraries listed below
Sorting:
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Duke is a fast and flexible deduplication engine written in Java☆621Updated last year
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆128Updated 9 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆382Updated 2 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- ☆111Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- ☆92Updated 9 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 7 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Graph Processing Algorithms on top of Neo4j☆39Updated 7 years ago
- A platform for real-time streaming search☆103Updated 9 years ago
- This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.☆211Updated 10 years ago
- Data Integration Graph☆206Updated 6 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- Scalable query engine for web scrapping/data mashup/acceptance QA, powered by Apache Spark☆142Updated 2 weeks ago
- Carrot2 plugin for ElasticSearch☆291Updated 2 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 10 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.☆380Updated 5 years ago
- Elasticsearch Index Termlist☆117Updated 6 years ago
- A bundle of useful Elasticsearch plugins☆110Updated last year
- Analytic UIMA pipelines using Spark☆23Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 4 years ago
- (deprecated) High performance Elasticsearch percolator☆47Updated 5 years ago
- A java library for stored queries☆375Updated 2 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago