MNoorFawi / lshashingLinks
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated 9 months ago
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- A python package for running directed acyclic graphs of asynchronous I/O operations☆16Updated 3 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- Public repository holding examples for dataheroes library☆22Updated last month
- ☆24Updated 3 years ago
- Examples of vector DB indexing and query with various vector databases.☆12Updated 3 months ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- Have UV deal with all your Jupyter deps.☆26Updated 9 months ago
- ☆17Updated last week
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- MinHash implementation in Python☆11Updated 9 months ago
- Python package for deduplication/entity resolution using active learning☆80Updated 9 months ago
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 3 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆33Updated 4 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- Versatile Metrics Collection for Python☆19Updated last year
- Advances in Neural Information Processing Systems (NeurIPS 2021)☆22Updated 2 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Create visualizations that are reproducible, easy to organize, and automatically detect if anything changes☆17Updated 6 months ago
- Caching and distributed locks in your applications with just one or two lines. Easy to learn. Fast to code.☆35Updated last month
- Text Processing & Segmentation Framework☆22Updated 2 months ago
- A simple and streamlined Python script to extract and filter links from a remote HTML resource.☆24Updated 4 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆26Updated last year
- ChatBot App built using LangChain and Lightning AI☆18Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 4 years ago
- Core Utilities for NVIDIA Merlin☆19Updated 10 months ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year