MNoorFawi / lshashing
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
β19Updated 6 months ago
Alternatives and similar repositories for lshashing:
Users that are interested in lshashing are comparing it to the libraries listed below
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β21Updated 2 years ago
- π Manipulate dynamic spreadsheets with arbitrary layouts using Pythonβ11Updated 2 years ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systemsβ10Updated last year
- Code for paper: "Privately generating tabular data using language models".β14Updated last year
- Python wrapper for the Lago Rest APIβ21Updated last week
- Notebooks on using transformers for sequential recommendation tasksβ16Updated 2 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fastβ16Updated 3 years ago
- Exploring the classical regression capabilities of LLMs.β19Updated 8 months ago
- A variation on a standard Decision Tree such as that in sklearn, where nodes may be based on an aggregation of multiple splits.β10Updated 8 months ago
- Have UV deal with all your Jupyter deps.β22Updated 5 months ago
- Efficient BM25 with DuckDB π¦β39Updated last month
- A Python library to perform NER on structured data and generate PII with Fakerβ29Updated 8 months ago
- β13Updated 7 months ago
- It's a cooler way to store simple linear models.β28Updated 7 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ22Updated last year
- Record matching and entity resolution at scale in Sparkβ34Updated last year
- Text Processing & Segmentation Frameworkβ20Updated 2 weeks ago
- LLM-Powered Analyses of your GitHub Community usingΒ EvaDBβ24Updated last year
- A tutorial on locality sensitive hashing, using MinHashing for document similarity and CosineSimilarity for Euclidean space similarity.β33Updated 4 years ago
- Deep Learning how-to's using Lance file formatβ15Updated 4 months ago
- Scientific framework for representation in sequential dataβ11Updated 2 years ago
- Simple Workflow Framework - Hamilton + APScheduler = FlowerPowerβ15Updated this week
- Public repository holding examples for dataheroes libraryβ22Updated 2 months ago
- Demo of using ChatGPT API for language learningβ12Updated last year
- This is the source code of the paper "Inferring Complementary Products from Baskets and Browsing Sessions"β11Updated 6 years ago
- Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to productionβ¦β29Updated last year
- Retrieval Augmented Generation applicationsβ26Updated last year
- Examples of vector DB indexing and query with various vector databases.β12Updated this week
- A library to use `modal` as a backend for `joblib`.β26Updated last month
- β12Updated last month