Efficient vector database for hundred millions of embeddings.
☆212May 17, 2024Updated last year
Alternatives and similar repositories for BinaryVectorDB
Users that are interested in BinaryVectorDB are comparing it to the libraries listed below
Sorting:
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 10 months ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 7 months ago
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 7 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 8 months ago
- ☆119Dec 18, 2024Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated 11 months ago
- Rust implementation of Surya☆66Mar 1, 2025Updated last year
- data cleaning and curation for unstructured text☆329Aug 6, 2024Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆209Aug 31, 2024Updated last year
- ☆209Jun 26, 2025Updated 8 months ago
- Neural Search☆366Mar 11, 2025Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆450Feb 13, 2024Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Jan 5, 2026Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Apr 11, 2024Updated last year
- Full text search that feels like a numpy array☆304Feb 1, 2026Updated last month
- Fast lexical search implementing BM25 in Python☆1,589Updated this week
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆22Jul 4, 2025Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- Tools to make language models a bit easier to use☆65Mar 12, 2026Updated last week
- Extract structured text from pdfs quickly☆672Jun 11, 2025Updated 9 months ago
- Interactive coding assistant for data scientists and machine learning developers, empowered by large language models.☆99Oct 8, 2024Updated last year
- Not financial advice.☆28Mar 18, 2023Updated 3 years ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated last year
- Cerule - A Tiny Mighty Vision Model☆68Nov 9, 2025Updated 4 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,564Mar 5, 2026Updated 2 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,956Updated this week
- Late Interaction Models Training & Retrieval☆754Mar 6, 2026Updated 2 weeks ago
- Smart reproducible analytical pipeline inspection☆21Feb 13, 2026Updated last month
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,882May 17, 2025Updated 10 months ago
- lossily compress representation vectors using product quantization☆59Oct 28, 2025Updated 4 months ago
- Tree-based indexes for neural-search☆31Mar 4, 2024Updated 2 years ago
- ☆12Feb 22, 2024Updated 2 years ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆283Mar 2, 2026Updated 2 weeks ago
- Red-Teaming Language Models with DSPy☆253Feb 13, 2025Updated last year
- A simple Python sandbox for helpful LLM data agents☆307Jun 18, 2024Updated last year
- A cool AI Diagram generator from a given topic, that streams the partial diagrams from the incomplete JSONs during generation. Built usin…☆218Apr 25, 2024Updated last year
- Rust Implementation of micrograd☆52Jul 3, 2024Updated last year