YannBrrd / elasticsearch-entity-resolution
Elasticsearch entity resolution plugin based on Duke
☆210Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for elasticsearch-entity-resolution
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- A bundle of useful Elasticsearch plugins☆110Updated 7 months ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated 11 months ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 6 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆251Updated 6 years ago
- Duke is a fast and flexible deduplication engine written in Java☆615Updated last year
- Solr Dictionary Annotator (Microservice for Spark)☆70Updated 4 years ago
- Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.☆382Updated 4 years ago
- ☆92Updated 9 years ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆128Updated 8 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 3 years ago
- Carrot2 plugin for ElasticSearch☆292Updated last year
- Search a single field with different query time analyzers in Solr☆25Updated 4 years ago
- Elasticsearch Index Termlist☆117Updated 5 years ago
- A platform for real-time streaming search☆103Updated 8 years ago
- This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.☆209Updated 10 years ago
- A java library for stored queries☆374Updated last year
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆269Updated 2 years ago
- ☆24Updated 9 years ago
- Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.☆60Updated 9 years ago
- ☆110Updated 7 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆382Updated last year
- (deprecated) High performance Elasticsearch percolator☆46Updated 5 years ago
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆124Updated 2 weeks ago