dimkar121 / LSHDBLinks
LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆31Updated 2 years ago
Alternatives and similar repositories for LSHDB
Users that are interested in LSHDB are comparing it to the libraries listed below
Sorting:
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- KnowledgeStore☆20Updated 7 years ago
- Ductile DB is a graph database based on Hadoop/HBase which provides a vast set of features.☆15Updated 7 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Demo application for GRADOOP operators☆23Updated 5 years ago
- ☆20Updated 8 years ago
- ☆20Updated 8 years ago
- ☆23Updated 5 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- RDF store on a cloud-based architecture (previously on https://code.google.com/p/cumulusrdf)☆31Updated 9 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- A framework for scalable graph computing.☆147Updated 7 years ago
- pythonic access to fastbit☆26Updated 6 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- A framework to benchmark different graph databases, based on generated data from customizable schema, distribution, and size.☆25Updated 6 years ago
- High-security graph database☆63Updated 2 years ago
- A Stanford CoreNLP server, with example clients, using Apache Thrift.☆47Updated 6 years ago
- Stanford Entity-Resolution Framework☆24Updated 7 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- ☆19Updated 7 years ago
- A distributed in-memory key-value storage for billions of small objects.☆24Updated 5 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 10 years ago