MNoorFawi / lshashingLinks
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated 10 months ago
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- A python package for running directed acyclic graphs of asynchronous I/O operations☆16Updated 3 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆32Updated 4 years ago
- Init setup for github repo.☆18Updated last month
- Python package for deduplication/entity resolution using active learning☆81Updated 10 months ago
- Notebooks on using transformers for sequential recommendation tasks☆17Updated 2 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- ☆18Updated last month
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- ☆15Updated last month
- ☆24Updated 3 years ago
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14Updated 2 years ago
- a convenient way to anonymize your data for analytics☆22Updated 3 years ago
- A graph query engine☆17Updated 3 months ago
- MinHash implementation in Python☆11Updated 10 months ago
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 5 months ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Python wrapper for the Lago Rest API☆24Updated this week
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆39Updated last year
- Cloud-agnostic Python API☆60Updated last year
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Updated last year
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆30Updated 2 weeks ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- scraping and querying documents for LLMs☆23Updated last month
- Caching and distributed locks in your applications with just one or two lines. Easy to learn. Fast to code.☆36Updated 2 months ago
- Efficient BM25 with DuckDB 🦆☆51Updated 6 months ago
- It's a cooler way to store simple linear models.☆27Updated 11 months ago
- Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.☆86Updated 7 months ago
- Prefect integrations for working with OpenAI.☆34Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 3 years ago
- Personalization with deep learning in 100 lines of code☆15Updated 2 years ago