LSH index for approximate set containment search
☆62Jun 27, 2022Updated 3 years ago
Alternatives and similar repositories for lshensemble
Users that are interested in lshensemble are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆20Apr 13, 2023Updated 3 years ago
- ☆27May 24, 2018Updated 8 years ago
- Minhash LSH in Golang☆27Sep 24, 2019Updated 6 years ago
- Go implementation of ntHash☆20Sep 16, 2021Updated 4 years ago
- ☆22Jan 3, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Locality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)☆108Jul 21, 2018Updated 7 years ago
- Minhash Index Extended to Knead Kmer Intersection☆11Mar 18, 2020Updated 6 years ago
- BottomK minwise hashing for streaming set similarity☆44Mar 15, 2019Updated 7 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Searching large collections of sequencing data with genome-scale queries☆18May 26, 2026Updated 2 weeks ago
- D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf☆21Nov 18, 2021Updated 4 years ago
- Resources for PVLDB 2023 submission☆28Aug 28, 2024Updated last year
- Histosketching Using Little Kmers☆57May 25, 2023Updated 3 years ago
- A tool facilitating matching columns across tabular datasets. It also serves as an experiment suite for state-of-the-art schema matching …☆117May 15, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MARVIS (Modality Adaptive Reasoning over VISualizations) is an 'everything predictor' powered by VLMs + embeddings☆18Apr 15, 2026Updated last month
- A Brand New LSH: The fly’s olfactory circuits algorithm☆11May 2, 2018Updated 8 years ago
- CGo wrapper package around the libtidy, the HTML tidy library☆22Nov 19, 2021Updated 4 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21May 5, 2018Updated 8 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,928Apr 18, 2026Updated last month
- A graph-inspired data structure for determining likely chains of sequences from breadcrumbs of evidence☆17Jun 29, 2021Updated 4 years ago
- Efficient set similarity search algorithms implemented in Go☆35Aug 27, 2022Updated 3 years ago
- Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.☆22Dec 14, 2024Updated last year
- De Bruijn graph representation in low memory☆36Jul 6, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Confirming specific taxonomic groups within your samples.☆19Dec 21, 2020Updated 5 years ago
- Approximate Nearest Neighbor using the MRPT algorithm☆23Aug 22, 2018Updated 7 years ago
- A search engine for Open Data☆60Mar 15, 2023Updated 3 years ago
- ☆18Jul 9, 2018Updated 7 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44May 12, 2025Updated last year
- A Go implementation of the strobemers (https://github.com/ksahlin/strobemers)☆14Apr 23, 2021Updated 5 years ago
- Adaptive version of KMV algorithm for cardinality estimation☆22May 5, 2019Updated 7 years ago
- ☆27Jan 31, 2019Updated 7 years ago
- google all pairs similarity search package, with swig bindings☆23Feb 26, 2015Updated 11 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 🔬 R package: Analysis of Large Affymetrix Microarray Data Sets☆12Dec 14, 2025Updated 5 months ago
- Fast approximation of similarity for sets of very different sizes☆20Mar 8, 2022Updated 4 years ago
- ☆15Jan 16, 2018Updated 8 years ago
- dmmclust is a package for clustering short texts, based on Yin and Wang (2014)☆26Dec 13, 2017Updated 8 years ago
- How to use Chinese font in Matplotlib, the complete guide.☆13May 19, 2018Updated 8 years ago
- A message queue for genomic surveillance☆20Oct 14, 2021Updated 4 years ago
- A Rust interface for the Succinct Data Structure Library.☆15Jan 24, 2022Updated 4 years ago