dimkar121 / LSHDBLinks
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 3 years ago
Alternatives and similar repositories for LSHDB
Users that are interested in LSHDB are comparing it to the libraries listed below
Sorting:
- Dynamic Distributed Dimensional Data Model☆42Updated last year
- How to spot first stories on Twitter using Storm.☆124Updated 2 years ago
- Blazegraph Tinkerpop3 Implementation☆62Updated 5 years ago
- Demo application for GRADOOP operators☆24Updated 5 years ago
- A framework for scalable graph computing.☆152Updated 7 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- GraphChi's Java version☆239Updated 2 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆80Updated 6 months ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 3 years ago
- Representing and mining dynamical social networks in Neo4j☆117Updated 4 years ago
- Scalable Graph Mining☆63Updated 3 years ago
- The Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning☆134Updated 4 years ago
- ☆92Updated 10 years ago
- *Experimental* GraphChi-DB graph database with computational capabilities☆79Updated 10 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 6 years ago
- zenvisage's foundational framework☆70Updated 3 years ago
- Provides a SQL interface to your TinkerPop enabled graph db☆74Updated 2 years ago
- Real-time query spark and visualise it as graph.☆24Updated 8 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆111Updated 9 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 5 years ago
- Fast in-memory graph structure, powering Gephi☆79Updated last week
- High-security graph database☆64Updated 3 years ago
- Mirror of Apache Stanbol (incubating)☆116Updated last year
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 10 years ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆202Updated 5 years ago
- Java port of TLSH (Trend Micro Locality Sensitive Hash)☆24Updated 4 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- Distributed Temporal Graph Analytics with Apache Flink☆252Updated 3 weeks ago
- ☆70Updated 7 years ago