dimkar121 / LSHDB
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 2 years ago
Alternatives and similar repositories for LSHDB:
Users that are interested in LSHDB are comparing it to the libraries listed below
- Dynamic Distributed Dimensional Data Model☆42Updated last year
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- Demo application for GRADOOP operators☆23Updated 4 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- KnowledgeStore☆20Updated 7 years ago
- A system and a Java API for large-scale graph processing based on Google's Pregel☆64Updated 12 years ago
- A framework for scalable graph computing.☆147Updated 6 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- High-security graph database☆62Updated 2 years ago
- Temporal_Graph_library☆25Updated 6 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- ☆23Updated 5 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- *Experimental* GraphChi-DB graph database with computational capabilities☆79Updated 9 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Exploration Library in Java☆12Updated last year
- Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit i…☆43Updated 11 years ago
- ☆15Updated 7 years ago
- Scalable Graph Mining☆62Updated 2 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated last year
- Scalable Optical Character Recognition with Apache NiFi and Tesseract☆32Updated 8 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- ☆33Updated 10 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Java, Perl, Python, Javascript, Ruby, etc. examples to query alphasparql.bioontology.org☆34Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago