weblyzard / nilsimsa
A Java library for computing and comparing Nilsimsa string similarity hashes.
☆10Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for nilsimsa
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- This module contains an implementation of the Nilsimsa locality-sensitive hashing algorithm in Java.☆18Updated 5 years ago
- Graph Processing Algorithms on top of Neo4j☆39Updated 7 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 4 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆70Updated 4 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 6 years ago
- Run cassandra inside a java project without bring server deps into client classpath☆33Updated 5 years ago
- Document clustering based on Latent Semantic Analysis☆96Updated 14 years ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 8 years ago
- ☆20Updated 8 years ago
- Storm / Solr Integration☆19Updated 9 months ago
- Storm Cassandra Integration☆179Updated 11 months ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- (deprecated) Please use new nlp4l instead.☆66Updated 8 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 8 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 6 years ago
- Lucene based indexing in Cassandra☆61Updated 8 years ago
- ☆71Updated 6 years ago
- NLP Utilities in Java☆43Updated last year
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- ☆18Updated 8 years ago
- SolrCloud HAFT is a High Availability and Fault Tolerant Framework for SolrCloud☆30Updated 8 years ago
- Keyword extraction package for Spark.☆12Updated 7 years ago
- GraphAware Timer-Driven Runtime Module that executes PageRank-like algorithm on the graph☆26Updated 7 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 7 years ago