dimkar121 / LSHDBLinks
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 3 years ago
Alternatives and similar repositories for LSHDB
Users that are interested in LSHDB are comparing it to the libraries listed below
Sorting:
- How to spot first stories on Twitter using Storm.☆124Updated last year
- Dynamic Distributed Dimensional Data Model☆42Updated last year
- Blazegraph Tinkerpop3 Implementation☆62Updated 5 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- ☆92Updated 10 years ago
- A framework for scalable graph computing.☆150Updated 7 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- GraphChi's Java version☆238Updated last year
- Scalable Graph Mining☆63Updated 3 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- Demo application for GRADOOP operators☆24Updated 5 years ago
- Representing and mining dynamical social networks in Neo4j☆116Updated 3 years ago
- A Generalized Data Cleaning System☆50Updated 9 years ago
- The Chronos versioning project aims to provide easy-to-use and reliable versioned data storage.☆52Updated 5 years ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆201Updated 5 years ago
- Fast in-memory graph structure, powering Gephi☆74Updated 3 weeks ago
- `Slib` is a JAVA library dedicated to semantic data mining based on texts and/or ontology processing. The library is composed of various …☆84Updated 2 years ago
- Gremlin++: A C++ Interpreter for the Gremlin language.☆19Updated 10 months ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆35Updated 6 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- Distributed Temporal Graph Analytics with Apache Flink☆249Updated this week
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆79Updated 3 months ago
- Storm / Solr Integration☆19Updated last year
- GPU Acceleration for Apache Spark☆34Updated 10 years ago
- Graph Analytics Engine☆260Updated 11 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆45Updated 6 years ago
- A scala-based feature generation and modeling framework☆61Updated 7 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆110Updated 9 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆15Updated 7 years ago