ekzhu / datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,657Updated 9 months ago
Alternatives and similar repositories for datasketch:
Users that are interested in datasketch are comparing it to the libraries listed below
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,454Updated 5 months ago
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,150Updated 9 months ago
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆767Updated 2 years ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,163Updated 2 weeks ago
- A fast Python implementation of locality sensitive hashing.☆661Updated 4 years ago
- Header-only C++/python library for fast approximate nearest neighbors☆4,588Updated 7 months ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆981Updated 11 months ago
- Learning to Rank in TensorFlow☆2,767Updated last year
- A Python Implementation of Simhash Algorithm☆1,005Updated 2 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆284Updated last year
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆918Updated 4 years ago
- ☆3,156Updated 3 years ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- Learning embeddings for classification, retrieval and ranking.☆3,948Updated 2 years ago
- A system for quickly generating training data with weak supervision☆5,839Updated 10 months ago
- Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data☆1,289Updated 2 weeks ago
- Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/☆1,252Updated 3 years ago
- Library for factorization machines☆1,489Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,100Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,820Updated 8 months ago
- General purpose unsupervised sentence representations☆1,200Updated 2 years ago
- Basic Utilities for PyTorch Natural Language Processing (NLP)☆2,219Updated last year
- NLP, before and after spaCy☆2,215Updated last year
- All-pair set similarity search on millions of sets in Python and on a laptop☆592Updated 2 years ago
- Python interface to Google word2vec☆2,593Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,171Updated 8 months ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,822Updated last year
- A natural language modeling framework based on PyTorch☆6,326Updated 2 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,062Updated 3 weeks ago
- Deep recommender models using PyTorch.☆3,009Updated 2 years ago