MNoorFawi / lshashing
python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data
☆19Updated 9 months ago
Alternatives and similar repositories for lshashing
Users that are interested in lshashing are comparing it to the libraries listed below
Sorting:
- ☆15Updated 4 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- MinHash implementation in Python☆11Updated 8 months ago
- ☆24Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fast☆16Updated 3 years ago
- The repository provides code for the paper RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders, CIKM'24☆11Updated 6 months ago
- Efficient BM25 with DuckDB 🦆☆48Updated 4 months ago
- ☆30Updated 3 years ago
- LLM application tracing based on OpenTelemetry☆10Updated 2 months ago
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- Notebooks on using transformers for sequential recommendation tasks☆16Updated 2 years ago
- Python package for extractive NLP using the OpenAI API☆17Updated 8 months ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Updated 2 years ago
- Experiments to assess SPADE on different LLM pipelines.☆16Updated last year
- It's a cooler way to store simple linear models.☆28Updated 10 months ago
- A python package for running directed acyclic graphs of asynchronous I/O operations☆16Updated 3 years ago
- Convert any image into a Region Adjacency Graph (RAG)☆12Updated 5 years ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated last year
- Feste is a free and open-source framework allowing scalable composition of NLP tasks using a graph execution model that is optimized and …☆41Updated 2 years ago
- efficient query encoding for dense retrieval☆11Updated 9 months ago
- Hyperparameter tuning via uncertainty modeling☆47Updated last year
- ☆22Updated 3 years ago
- Python wrapper for the Lago Rest API☆24Updated this week
- Helpers for scikit learn☆16Updated 2 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 4 months ago
- Public repository holding examples for dataheroes library☆22Updated 2 weeks ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago