MNoorFawi / lshashingLinks
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated last year
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 4 years ago
- Demo on how to use Prefect with Docker☆27Updated 3 years ago
- ☆24Updated 4 years ago
- SPEAR: Programmatically label and build training data quickly.☆109Updated last year
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆34Updated 5 years ago
- Public repository holding examples for dataheroes library☆25Updated 8 months ago
- Personalized Purchase Prediction of Market Baskets with Wasserstein-Based Sequence Matching☆19Updated 6 years ago
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production…☆29Updated 2 years ago
- Wave Partial Differential Equation Solver in Python☆14Updated last year
- Loan Risk Prediction Neural Network and API☆17Updated 5 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 4 years ago
- Django plugin for online machine learning with river (under-development)☆15Updated 2 years ago
- Exploring the classical regression capabilities of LLMs.☆18Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- Versatile Metrics Collection for Python☆20Updated last month
- Automatic machine learning for tabular data. ⚡🔥⚡☆70Updated 4 years ago
- Have UV deal with all your Jupyter deps.☆28Updated last year
- Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.☆87Updated last year
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆41Updated last year
- Fast fuzzy text search☆11Updated 2 years ago
- Notebooks on using transformers for sequential recommendation tasks☆17Updated 3 years ago
- It's a cooler way to store simple linear models.☆27Updated last year
- Deep Learning how-to's using Lance file format☆22Updated 8 months ago
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆53Updated 3 years ago
- A Modal that works with Panel in both server and notebook environments.☆21Updated 2 years ago
- Pipeline components that support partial_fit.☆46Updated last year
- Record matching and entity resolution at scale in Spark☆36Updated 2 years ago
- ☆31Updated 4 years ago
- FuturePool is a package that introduce known concept of multiprocessing Pool to the async/await world. It allows for easy translation fro…☆14Updated last year