weblyzard / nilsimsaLinks
A Java library for computing and comparing Nilsimsa string similarity hashes.
☆11Updated 3 years ago
Alternatives and similar repositories for nilsimsa
Users that are interested in nilsimsa are comparing it to the libraries listed below
Sorting:
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 9 years ago
- This module contains an implementation of the Nilsimsa locality-sensitive hashing algorithm in Java.☆18Updated 6 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 7 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆81Updated 2 months ago
- Elasticsearch plugin for b-bit minhash algorism☆63Updated last year
- Topic Modeling with LDA in Scala and Spark☆31Updated 7 years ago
- Scala stuff☆18Updated 6 years ago
- Provides a SQL interface to your TinkerPop enabled graph db☆75Updated 2 years ago
- command line tool for Apache Lucene☆163Updated 2 months ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆27Updated 6 years ago
- A new object-graph-wrapper for the Tinkerpop 3 graph stack.☆40Updated 4 years ago
- Lucene based indexing in Cassandra☆61Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 2 months ago
- Browser-driven explorer for lucene indexes☆74Updated 4 years ago
- ☆76Updated 8 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Lucene implementation)☆187Updated last week
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆202Updated 5 years ago
- Kafka Connect Cassandra Connector. This project includes source/sink connectors for Cassandra to/from Kafka.☆78Updated 9 years ago
- A scala-based feature generation and modeling framework☆61Updated 7 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 10 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- TinkerPop3 Graph Structure Implementation for OrientDB☆94Updated 3 weeks ago
- Integration of Samza and Luwak☆100Updated 10 years ago
- something to help you spark☆64Updated 6 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago