ekzhu / datasketchLinks
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,765Updated last year
Alternatives and similar repositories for datasketch
Users that are interested in datasketch are comparing it to the libraries listed below
Sorting:
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,529Updated 11 months ago
- A fast Python implementation of locality sensitive hashing.☆667Updated 5 years ago
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,153Updated last year
- Benchmarks of approximate nearest neighbor libraries in Python☆5,417Updated 2 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆1,022Updated 2 months ago
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆769Updated 2 years ago
- A Python Implementation of Simhash Algorithm☆1,027Updated 3 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,956Updated 2 years ago
- Header-only C++/python library for fast approximate nearest neighbors☆4,875Updated 2 months ago
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆919Updated 4 years ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆13,922Updated last year
- Deep recommender models using PyTorch.☆3,025Updated 2 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,107Updated last week
- A Python nearest neighbor descent for approximate nearest neighbors☆942Updated 9 months ago
- Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data☆1,323Updated last month
- A system for quickly generating training data with weak supervision☆5,910Updated last year
- A library implementing different string similarity and distance measures using Python.☆1,016Updated 2 years ago
- Library for factorization machines☆1,493Updated 5 years ago
- A python binding for crfsuite☆772Updated 11 months ago
- Fast implementation of the edit distance(Levenshtein distance)☆687Updated last year
- A Library for Field-aware Factorization Machines☆1,604Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,108Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,853Updated 3 weeks ago
- fastFM: A Library for Factorization Machines☆1,086Updated 3 years ago
- ☆1,241Updated last year
- Fast Python Collaborative Filtering for Implicit Feedback Datasets☆3,702Updated last year
- A high performance implementation of HDBSCAN clustering.☆2,979Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,277Updated 4 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,151Updated last week
- Python library implementing a trie data structure.☆824Updated 4 years ago