LSH index for approximate set containment search
☆61Jun 27, 2022Updated 3 years ago
Alternatives and similar repositories for lshensemble
Users that are interested in lshensemble are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆19Apr 13, 2023Updated 2 years ago
- ☆26May 24, 2018Updated 7 years ago
- Minhash LSH in Golang☆27Sep 24, 2019Updated 6 years ago
- Locality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)☆108Jul 21, 2018Updated 7 years ago
- Minhash Index Extended to Knead Kmer Intersection☆11Mar 18, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆78Mar 6, 2023Updated 3 years ago
- BottomK minwise hashing for streaming set similarity☆44Mar 15, 2019Updated 7 years ago
- large-memory key-value pair store for Python☆50May 26, 2013Updated 12 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Searching large collections of sequencing data with genome-scale queries☆17Feb 5, 2026Updated last month
- Flexible omics pipeline☆17Oct 16, 2025Updated 5 months ago
- D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf☆21Nov 18, 2021Updated 4 years ago
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- Histosketching Using Little Kmers☆57May 25, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆105Updated this week
- A Brand New LSH: The fly’s olfactory circuits algorithm☆11May 2, 2018Updated 7 years ago
- CGo wrapper package around the libtidy, the HTML tidy library☆22Nov 19, 2021Updated 4 years ago
- ArcheType uses LLMs to automatically assign custom labels to your tabular data☆19May 21, 2025Updated 10 months ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW☆2,892Updated this week
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆23Mar 31, 2025Updated 11 months ago
- A graph-inspired data structure for determining likely chains of sequences from breadcrumbs of evidence☆17Jun 29, 2021Updated 4 years ago
- Efficient set similarity search algorithms implemented in Go☆35Aug 27, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.☆22Dec 14, 2024Updated last year
- De Bruijn graph representation in low memory☆35Jul 6, 2024Updated last year
- A database for signatures of public genomic sources☆18Jan 1, 2026Updated 2 months ago
- A counter data structure that knows when to start estimating to save space☆34Oct 9, 2017Updated 8 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆15Dec 24, 2023Updated 2 years ago
- Approximate Nearest Neighbor using the MRPT algorithm☆23Aug 22, 2018Updated 7 years ago
- Streaming sequence classification with web services ✓📌☆19Dec 8, 2022Updated 3 years ago
- A search engine for Open Data☆59Mar 15, 2023Updated 3 years ago
- ☆10Jun 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Build sourmash databases for genbank.☆11May 4, 2023Updated 2 years ago
- ☆18Jul 9, 2018Updated 7 years ago
- A Go implementation of the strobemers (https://github.com/ksahlin/strobemers)☆14Apr 23, 2021Updated 4 years ago
- Adaptive version of KMV algorithm for cardinality estimation☆22May 5, 2019Updated 6 years ago
- ☆27Jan 31, 2019Updated 7 years ago
- google all pairs similarity search package, with swig bindings☆23Feb 26, 2015Updated 11 years ago
- 🔬 R package: Analysis of Large Affymetrix Microarray Data Sets☆12Dec 14, 2025Updated 3 months ago