ekzhu / datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
☆2,693Updated 11 months ago
Alternatives and similar repositories for datasketch
Users that are interested in datasketch are comparing it to the libraries listed below
Sorting:
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆3,483Updated 7 months ago
- FAst Lookups of Cosine and Other Nearest Neighbors (based on fast locality-sensitive hashing)☆1,151Updated 11 months ago
- A fast Python implementation of locality sensitive hashing.☆664Updated 5 years ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,266Updated 3 weeks ago
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,836Updated last year
- NLP, before and after spaCy☆2,225Updated last year
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆994Updated last year
- Header-only C++/python library for fast approximate nearest neighbors☆4,679Updated 3 weeks ago
- A Python Implementation of Simhash Algorithm☆1,009Updated 3 years ago
- Approximate Nearest Neighbor Search for Sparse Data in Python!☆919Updated 4 years ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,462Updated 3 weeks ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆13,728Updated 9 months ago
- Python framework for fast (approximated) nearest neighbour search in large, high-dimensional data sets using different locality-sensitive…☆767Updated 2 years ago
- A system for quickly generating training data with weak supervision☆5,856Updated last year
- Learning embeddings for classification, retrieval and ranking.☆3,953Updated 2 years ago
- A fast, efficient universal vector embedding utility package.☆1,647Updated last year
- ☆3,161Updated 3 years ago
- NLP made easy☆2,559Updated last year
- A Python nearest neighbor descent for approximate nearest neighbors☆920Updated 6 months ago
- Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data☆1,302Updated 2 weeks ago
- Learning to Rank in TensorFlow☆2,776Updated last year
- Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://…☆2,389Updated 3 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,848Updated last month
- Navigating Spreading-out Graph For Approximate Nearest Neighbor Search☆670Updated last year
- Open-source implementation of Google Vizier for hyper parameters tuning☆1,557Updated 5 years ago
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,721Updated 9 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,107Updated last year
- Generate embeddings from large-scale graph-structured data.☆3,412Updated last year
- Python library for interactive topic model visualization. Port of the R LDAvis package.☆1,831Updated 10 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,472Updated 3 weeks ago