LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record linkage (and privacy-preserving record linkage) and similarity search tasks.
☆32Aug 30, 2022Updated 3 years ago
Alternatives and similar repositories for LSHDB
Users that are interested in LSHDB are comparing it to the libraries listed below
Sorting:
- Graph Traversal (BFS & DFS), Single Source Shortest Path, Minimum Spanning Tree, RB Trees, B-Trees☆15Dec 8, 2011Updated 14 years ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 2 months ago
- Architecture and UX design of KAML-D☆14Apr 3, 2018Updated 7 years ago
- A template-based cluster provisioning system☆61Mar 4, 2023Updated 3 years ago
- ☆12Dec 5, 2015Updated 10 years ago
- Terraform module for Cloudera Manager☆11May 6, 2020Updated 5 years ago
- An expansive bundle of NiFi additions intended to be used for generating test data☆11Aug 6, 2023Updated 2 years ago
- Window-Based Hybrid CPU/GPU Stream Processing Engine☆42Nov 16, 2022Updated 3 years ago
- The OpenFISMA project is an open source application designed to reduce the complexity and automate the regulatory requirements of the Fed…☆10Apr 21, 2015Updated 10 years ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors☆11Apr 8, 2025Updated 11 months ago
- Scripts and documentation for Waffle Takeout - Waffle's on-premises solution.☆11Jan 4, 2017Updated 9 years ago
- Express.js middleware to support an LDP server built on MongoDB☆14Updated this week
- How I got cmusphinx's transcript alignment tool to work.☆25Jun 10, 2015Updated 10 years ago
- Clustering and Link Prediction Evaluation in R☆14Sep 23, 2023Updated 2 years ago
- ☆23Dec 4, 2023Updated 2 years ago
- NiFi Dynamic Script Executors☆15Jul 17, 2016Updated 9 years ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆19Mar 9, 2026Updated last week
- Miscellaneous for various things☆21Nov 19, 2024Updated last year
- ⌨️ typed.js R htmlwidgets☆15Dec 12, 2021Updated 4 years ago
- A simple command line interface to the datamade/dedupe library.☆43Dec 26, 2022Updated 3 years ago
- Convert text to speech and create voice narrations☆18Dec 11, 2025Updated 3 months ago
- PhipsBoot is a relocatable x86_64 bootloader for legacy boot written in Rust and assembly.☆14Mar 2, 2025Updated last year
- pluggable payments plugin for django-oscar☆12Dec 13, 2017Updated 8 years ago
- ☆11Dec 12, 2025Updated 3 months ago
- A collection of datasets and databases☆24May 16, 2018Updated 7 years ago
- Slides and sample code from presentations at our meetup.☆11Aug 13, 2024Updated last year
- Distributed Lisp interpreter in Erlang.☆11Dec 14, 2016Updated 9 years ago
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- ☆10Sep 25, 2020Updated 5 years ago
- This repository contains CROW, the Clerical Resolution Online Widget, an open-source project designed to help data linkers with their cle…☆11Mar 5, 2026Updated 2 weeks ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆25Apr 14, 2023Updated 2 years ago
- An R package for blocking records for record linkage / data deduplication based on approximate nearest neighbours algorithms.☆14Mar 11, 2026Updated last week
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 4 years ago
- Fast, unopinionated, minimalist (fluent) web framework for Golo☆15Aug 8, 2015Updated 10 years ago
- 一个简易的正则表达式引擎!☆10Apr 9, 2017Updated 8 years ago
- Lightweight validation tool for checking function arguments and data analysis scripts.☆12Dec 24, 2024Updated last year
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- 👨🔧Jekyll integration with Google Workbox to create Service Worker automatically.☆14Feb 1, 2019Updated 7 years ago