dimkar121 / LSHDB
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 2 years ago
Alternatives and similar repositories for LSHDB:
Users that are interested in LSHDB are comparing it to the libraries listed below
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Dynamic Distributed Dimensional Data Model☆41Updated 11 months ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- ☆71Updated 7 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A framework for scalable graph computing.☆147Updated 6 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated 11 months ago
- Vizlinc☆14Updated 9 years ago
- Scalable Graph Mining☆61Updated 2 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆20Updated 3 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- pythonic access to fastbit☆26Updated 6 years ago
- Spark algorithms for building k-nn graphs☆42Updated 6 years ago
- Pattern-of-Behavior Search Tool☆11Updated 2 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- A java library for stored queries☆16Updated last year
- Exploration Library in Java☆12Updated last year
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 7 years ago
- Demo application for GRADOOP operators☆23Updated 4 years ago
- The Chronos versioning project aims to provide easy-to-use and reliable versioned data storage.☆52Updated 4 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- ☆33Updated 10 years ago
- TensorDB: In-Database Tensor Manipulation with Tensor-Relational Query Plans☆20Updated 10 years ago