dimkar121 / LSHDBLinks
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 3 years ago
Alternatives and similar repositories for LSHDB
Users that are interested in LSHDB are comparing it to the libraries listed below
Sorting:
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- ☆92Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Blazegraph Tinkerpop3 Implementation☆62Updated 4 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆79Updated last month
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 10 years ago
- How to spot first stories on Twitter using Storm.☆124Updated last year
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆201Updated 5 years ago
- Provides a SQL interface to your TinkerPop enabled graph db☆75Updated 2 years ago
- GraphChi's Java version☆238Updated last year
- Real-time query spark and visualise it as graph.☆24Updated 8 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Demo application for GRADOOP operators☆24Updated 5 years ago
- A framework for scalable graph computing.☆150Updated 7 years ago
- GPU Acceleration for Apache Spark☆34Updated 10 years ago
- Scalable Graph Mining☆63Updated 2 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 5 years ago
- Fast in-memory graph structure, powering Gephi☆74Updated this week
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 6 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 10 years ago
- Java Matrix Benchmark is a tool for evaluating Java linear algebra libraries for speed, stability, and memory usage.☆61Updated 2 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- A scala-based feature generation and modeling framework☆61Updated 7 years ago
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆21Updated 4 years ago
- ☆24Updated 9 years ago
- Apache OpenNLP Sandbox☆44Updated last week