MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,916Apr 18, 2026Updated last month
Alternatives and similar repositories for datasketch
Users that are interested in datasketch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆291Jun 11, 2023Updated 2 years ago
- Weighted MinHash implementation on CUDA (multi-gpu).☆122Nov 29, 2023Updated 2 years ago
- ☆1,270Jul 30, 2024Updated last year
- A fast Python implementation of locality sensitive hashing.☆678Apr 30, 2020Updated 6 years ago
- All-in-one text de-duplication☆759Mar 9, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,239Oct 29, 2025Updated 6 months ago
- Example Python code for comparing documents using MinHash☆252Feb 11, 2019Updated 7 years ago
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,160Jun 1, 2024Updated last year
- LSH index for approximate set containment search☆62Jun 27, 2022Updated 3 years ago
- All-pair set similarity search on millions of sets in Python and on a laptop☆604Oct 11, 2022Updated 3 years ago
- A library for efficient similarity search and clustering of dense vectors.☆40,061Updated this week
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,582Apr 13, 2026Updated last month
- A Python Implementation of Simhash Algorithm☆1,037Mar 24, 2022Updated 4 years ago
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆772Feb 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmarks of approximate nearest neighbor libraries in Python☆5,667Jun 10, 2025Updated 11 months ago
- A fast Python implementation of locality sensitive hashing.☆71Mar 13, 2015Updated 11 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆150Sep 4, 2024Updated last year
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆918Oct 2, 2020Updated 5 years ago
- State-of-the-Art Embeddings, Retrieval, and Reranking☆18,669May 12, 2026Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,058May 6, 2026Updated last week
- Library for fast text representation and classification.