Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K most similar items for a large number of items by chunking the item matrix representation (embeddings) and using Numba to accelerate the calculations.
☆86Dec 28, 2024Updated last year
Alternatives and similar repositories for chunkdot
Users that are interested in chunkdot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FastAPI for Triton☆18Jun 11, 2022Updated 3 years ago
- Evals that meet you where you are. For AI that's grounded.☆55Feb 6, 2026Updated last month
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Aug 24, 2024Updated last year
- ☆12Oct 12, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple LLM-enabled document Q&A app built using Langchain and Streamlit☆10Dec 4, 2024Updated last year
- ☆14Aug 9, 2024Updated last year
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Jan 8, 2024Updated 2 years ago
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- ☆20Jan 27, 2024Updated 2 years ago
- The Journey of RAG: From Notebook to Microservices☆27Feb 22, 2024Updated 2 years ago
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- AI Projects contains various projects which I have written about in my medium articles.☆55Aug 20, 2024Updated last year
- A Python implementation of the Thumbhash image placeholder generation algorithm.☆16Mar 12, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Apr 26, 2025Updated 10 months ago
- Browser automation for creating new pages in WordPress☆13Jun 7, 2025Updated 9 months ago
- pysh-db - The Data Science Toolkit (DSK)☆13Dec 5, 2018Updated 7 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- ☆16Jun 27, 2021Updated 4 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- This repo contains the example code used in my Medium article about NeuralProphet.☆15Dec 16, 2020Updated 5 years ago
- Neuro-Symbolic-Causal AI Agent — Project Chimera 🌌 | Open-source hybrid intelligence☆46Feb 20, 2026Updated last month
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Sep 10, 2023Updated 2 years ago
- A simple ReAct agent that has access to LlamaIndex docs and to the internet to provide you with insights on LlamaIndex itself.☆11Feb 23, 2025Updated last year
- ☆24Mar 9, 2016Updated 10 years ago
- CodebaseMD: A VS Code extension that converts codebases into structured Markdown documentation, optimized for LLMs and agentic coding too…☆15May 22, 2025Updated 10 months ago
- FinOps : Cost Optimization techniques in AWS Cloud. Cloud Cost Optimization☆13Apr 29, 2023Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- ☆10Nov 12, 2024Updated last year
- This is a Streamlit-based application designed to revolutionize the real estate search process with the power of AI. Utilizing Qdrant for…☆12Feb 14, 2024Updated 2 years ago
- ☆10Dec 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Locality Sensitive Hashing for semantic similarity (Python 3.x)☆15Jun 8, 2018Updated 7 years ago
- ☆25Mar 28, 2023Updated 2 years ago
- ☆10Mar 26, 2024Updated 2 years ago
- Interpretable feature construction from taxonomies for text classification☆18Apr 4, 2022Updated 3 years ago
- Use a local LLM to convert PDF to Markdown☆32Mar 10, 2025Updated last year
- ☆27Feb 11, 2026Updated last month
- ☆15Nov 4, 2024Updated last year