Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
☆15Dec 24, 2023Updated 2 years ago
Alternatives and similar repositories for lazo
Users that are interested in lazo are comparing it to the libraries listed below
Sorting:
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- ☆78Mar 6, 2023Updated 3 years ago
- Efficient set similarity search algorithms implemented in Go☆35Aug 27, 2022Updated 3 years ago
- Python3 implementation of the Similarity Flooding algorithm (S. Melnik, H. Garcia-Molina, E. Rahm "Similarity Flooding: A Versatile Graph…☆12Jun 30, 2020Updated 5 years ago
- This repository provides the implementation of several well-know INDs discovery algorithms☆14Nov 5, 2019Updated 6 years ago
- An infinite canvas built for brainstorming.☆13Jul 31, 2024Updated last year
- An idiomatic Rust wrapper for the V8 Javascript engine☆12Sep 7, 2018Updated 7 years ago
- MinHash implementation in Python☆12Aug 24, 2024Updated last year
- Web app for streamhut☆16Jul 12, 2020Updated 5 years ago
- ☆22Jan 3, 2023Updated 3 years ago
- Benchmarking Machine Learning Model Inference in Data Streaming Solutions☆10Jun 12, 2024Updated last year
- Master thesis - reproducing state-of-the-art schema matching algorithms☆14Jul 6, 2023Updated 2 years ago
- mechanical-elephant.com☆11Jan 29, 2016Updated 10 years ago
- Implementing MISON by Microsoft in C++ as a test☆21Mar 1, 2018Updated 8 years ago
- An implementation of Kensler's hashed permutation algorithm☆17Jan 31, 2025Updated last year
- Pattern-based table discovery in Open Data CSV files☆25Dec 8, 2022Updated 3 years ago
- Implementation of algorithms proposed by [Huang and Kasiviswanathan]☆16Jul 27, 2016Updated 9 years ago
- ☆13Feb 11, 2019Updated 7 years ago
- ☆18Dec 26, 2025Updated 2 months ago
- Benchmarking suite for the Web-Scale Data Management course using Locust☆14Aug 9, 2024Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- [ESEC/FSE'23] Hue: A User-Adaptive Parser for Hybrid Logs☆10Aug 24, 2023Updated 2 years ago
- Portals is a framework for stateful serverless apps, unifying dataflow streaming with actors☆20Nov 15, 2023Updated 2 years ago
- Experimenting with database engines and related structures, algorithms and concepts.☆15Feb 15, 2026Updated last month
- ☆14Jan 12, 2021Updated 5 years ago
- A simple implementation of DyClee in Python: a DYnamic CLustering algorithm for tracking Evolving Environments.☆10Jun 17, 2024Updated last year
- Fast and accurate set similarity estimation via containment min hash☆42Jul 19, 2024Updated last year
- The Org-mode Parser for Python☆12Jan 28, 2017Updated 9 years ago
- Automate creating resilient, disposable, secure and agile monitoring infrastructure for Blue Teams.☆24Oct 31, 2022Updated 3 years ago
- Learning Rate Finder using Tensorflow Dataset☆10Jul 24, 2020Updated 5 years ago
- ☆26May 24, 2018Updated 7 years ago
- InfiDraw is a procedural drawing tool, which provides lightweight infinite canvas.☆14Oct 11, 2023Updated 2 years ago
- My master thesis — work on this has begun in February 2010, and continued until June 2011! Subject: "Web Performance Optimization: Analy…☆12Jul 25, 2011Updated 14 years ago
- Domain Generation Algorithms research papers, datasets and code☆15May 17, 2020Updated 5 years ago
- A monolithic index that supports worst-case optimal joins (WCOJ) by providing all collation orders in a single redundancy eliminating dat…☆16Sep 18, 2025Updated 6 months ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation☆25Jan 1, 2018Updated 8 years ago
- Tool to transform an ontology diagram into OWL code.☆39Apr 11, 2025Updated 11 months ago
- Talend Administration Center (TAC) Docker build files☆12Jun 23, 2016Updated 9 years ago
- ☆14May 6, 2018Updated 7 years ago