ekzhu / datasketchLinks
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,788Updated last year
Alternatives and similar repositories for datasketch
Users that are interested in datasketch are comparing it to the libraries listed below
Sorting:
- A fast Python implementation of locality sensitive hashing.☆669Updated 5 years ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,545Updated last week
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,154Updated last year
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆770Updated 2 years ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,470Updated 4 months ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,499Updated 6 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,039Updated 4 months ago
- Example Python code for comparing documents using MinHash☆251Updated 6 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆291Updated 2 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,114Updated this week
- A system for quickly generating training data with weak supervision☆5,923Updated last year
- A high performance implementation of HDBSCAN clustering.☆3,010Updated 2 weeks ago
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆920Updated 5 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,160Updated this week
- Learning embeddings for classification, retrieval and ranking.☆3,957Updated 2 years ago
- A Python nearest neighbor descent for approximate nearest neighbors☆949Updated last week
- A library implementing different string similarity and distance measures using Python.☆1,019Updated 2 years ago
- Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data