ekzhu / datasketchLinks
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,864Updated last week
Alternatives and similar repositories for datasketch
Users that are interested in datasketch are comparing it to the libraries listed below
Sorting:
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,566Updated 2 weeks ago
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,156Updated last year
- Benchmarks of approximate nearest neighbor libraries in Python☆5,585Updated 7 months ago
- A fast Python implementation of locality sensitive hashing.☆674Updated 5 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,077Updated last month
- Header-only C++/python library for fast approximate nearest neighbors☆5,074Updated 4 months ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆292Updated 2 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,123Updated last month
- A Collection of BM25 Algorithms in Python☆1,296Updated last year
- Learning embeddings for classification, retrieval and ranking.☆3,958Updated 3 years ago
- Learning to Rank in TensorFlow☆2,780Updated last year
- A fast, efficient universal vector embedding utility package.☆1,652Updated 2 years ago
- A system for quickly generating training data with weak supervision☆5,937Updated last year
- Fast Python Collaborative Filtering for Implicit Feedback Datasets☆3,762Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,875Updated 3 weeks ago
- Simple web service providing a word embedding model☆1,445Updated 2 years ago
- Port of Google's language-detection library to Python.☆1,870Updated 10 months ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,187Updated last month
- A python tool for evaluating the quality of sentence embeddings.☆2,107Updated last year
- All-pair set similarity search on millions of sets in Python and on a laptop☆604Updated 3 years ago
- A large annotated semantic parsing corpus for developing natural language interfaces.☆1,797Updated 3 months ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,135Updated 3 months ago
- Example Python code for comparing documents using MinHash☆251Updated 6 years ago
- Stand-alone language identification system☆2,450Updated 6 years ago
- Python library implementing a trie data structure.☆824Updated 4 years ago
- 🦆 Contextually-keyed word vectors☆1,668Updated 9 months ago
- A library implementing different string similarity and distance measures using Python.☆1,020Updated 3 years ago
- General purpose unsupervised sentence representations☆1,208Updated 3 years ago
- A Python nearest neighbor descent for approximate nearest neighbors☆958Updated 3 weeks ago
- GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.☆1,267Updated 6 years ago