MNoorFawi / lshashingLinks
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated 11 months ago
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 3 years ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆33Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆81Updated 11 months ago
- Fast fuzzy text search☆11Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆18Updated 3 years ago
- ☆24Updated 3 years ago
- Demo on how to use Prefect with Docker☆27Updated 2 years ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 3 years ago
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production…☆29Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 7 months ago
- Meadowflow is a proof-of-concept/prototype job scheduler built to explore the idea of implicit data dependency management.☆10Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.☆29Updated 6 months ago
- Identify bias and measure fairness of your data☆94Updated last week
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- Deep Learning how-to's using Lance file format☆19Updated 2 months ago
- A variation on a standard Decision Tree such as that in sklearn, where nodes may be based on an aggregation of multiple splits.☆9Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Cloud-agnostic Python API☆60Updated last year
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Efficient BM25 with DuckDB 🦆☆54Updated 7 months ago
- It's a cooler way to store simple linear models.☆27Updated last year
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Have UV deal with all your Jupyter deps.☆27Updated 11 months ago
- Exploring some issues related to churn☆16Updated last year
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆79Updated last year
- Pipeline components that support partial_fit.☆46Updated last year
- Public repository holding examples for dataheroes library☆23Updated 2 months ago
- Versatile Metrics Collection for Python☆19Updated 2 weeks ago