MNoorFawi / lshashing
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated 5 months ago
Alternatives and similar repositories for lshashing:
Users that are interested in lshashing are comparing it to the libraries listed below
- MinHash implementation in Python☆11Updated 4 months ago
- scraping and querying documents for LLMs☆17Updated 3 weeks ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Notebooks on using transformers for sequential recommendation tasks☆16Updated 2 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- 2 Lines of code to track ML experiments + EDA + check into Github☆28Updated 2 years ago
- 🍞 Manipulate dynamic spreadsheets with arbitrary layouts using Python☆11Updated 2 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆22Updated last week
- Efficient BM25 with DuckDB 🦆☆36Updated last month
- Clone of chatgpt built with Bytewax, Streamlit and NATS☆15Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- A variation on a standard Decision Tree such as that in sklearn, where nodes may be based on an aggregation of multiple splits.☆10Updated 7 months ago
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.☆33Updated 3 years ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆18Updated last month
- A python package for running directed acyclic graphs of asynchronous I/O operations☆16Updated 3 years ago
- Distributed Task Queue based Dask☆36Updated last year
- Fast model deployment on AWS Lambda☆14Updated 10 months ago
- efficient query encoding for dense retrieval☆11Updated 5 months ago
- Have UV deal with all your Jupyter deps.☆22Updated 4 months ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Exploring the classical regression capabilities of LLMs.☆18Updated 8 months ago
- Deep Learning how-to's using Lance file format☆15Updated 4 months ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- A Modal that works with Panel in both server and notebook environments.☆20Updated last year
- Reading list for research topics in intent analysis.☆15Updated last year
- Code for paper: "Privately generating tabular data using language models".☆14Updated last year
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated 7 months ago
- Personalization with deep learning in 100 lines of code☆14Updated last year