dimkar121 / LSHDB
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 2 years ago
Alternatives and similar repositories for LSHDB:
Users that are interested in LSHDB are comparing it to the libraries listed below
- Dynamic Distributed Dimensional Data Model☆42Updated last year
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated last year
- KnowledgeStore☆20Updated 7 years ago
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- A framework for scalable graph computing.☆147Updated 6 years ago
- ☆33Updated 10 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- Fast in-memory graph structure, powering Gephi☆75Updated 6 months ago
- Apache NiFi NLP Processor☆18Updated last year
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆13Updated 7 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆109Updated 8 years ago
- Machine Learning for Cascading☆81Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- TensorDB: In-Database Tensor Manipulation with Tensor-Relational Query Plans☆20Updated 10 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Exploration Library in Java☆12Updated last year
- Distributed Matrix Library☆71Updated 8 years ago
- Benchmarking various graph databases, engines, datastructures, and data stores.☆37Updated 11 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- ByteBuffer collection classes for java and jvm-based languages.☆33Updated 7 years ago
- Library of graph algorithms for Apache Giraph.☆8Updated 9 years ago
- ☆41Updated 7 years ago
- High-security graph database☆62Updated 2 years ago