mixedbread-ai/baguetter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mixedbread-ai/baguetter)

mixedbread-ai / baguetter

Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter supports sparse (traditional), dense (semantic), and hybrid retrieval methods.

☆211

Alternatives and similar repositories for baguetter

Users that are interested in baguetter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mixedbread-ai / ofen
View on GitHub
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆17Oct 2, 2024Updated last year
mixedbread-ai / wiki_demo_app
View on GitHub
☆14Jun 25, 2024Updated 2 years ago
mixedbread-ai / binary-embeddings
View on GitHub
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆19Mar 23, 2024Updated 2 years ago
mixedbread-ai / python-sdk
View on GitHub
mixedbread ai python sdk
☆12Jul 1, 2024Updated 2 years ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,746Updated this week
mixedbread-ai / mxbai-rerank
View on GitHub
Crispy reranking models by Mixedbread
☆52Sep 17, 2025Updated 10 months ago
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,626Dec 20, 2025Updated 7 months ago
lightonai / ducksearch
View on GitHub
Efficient BM25 with DuckDB 🦆
☆68Dec 20, 2024Updated last year
illuin-tech / modernvbert
View on GitHub
ModernVBERT is a 250M-parameter vision–language encoder that aligns a text-encoder (Ettin-150M) with a vision-encoder (SigLIP2-B) through…
☆16Oct 16, 2025Updated 9 months ago
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆271May 28, 2026Updated last month
xhluca / bm25-benchmarks
View on GitHub
☆24Jul 10, 2026Updated 2 weeks ago
AnswerDotAI / byaldi
View on GitHub
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
☆851Jan 28, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hscells / pybool_ir
View on GitHub
Toolkit for domain-specific information retrieval experimentation
☆19May 18, 2026Updated 2 months ago
jina-ai / late-chunking
View on GitHub
Code for explaining and evaluating late chunking (chunked pooling)
☆533Dec 23, 2024Updated last year
AmenRa / ranx
View on GitHub
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
☆689Aug 7, 2025Updated 11 months ago
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,943May 17, 2025Updated last year
pisa-engine / BMP
View on GitHub
Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.
☆37Jan 14, 2026Updated 6 months ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
ant-louis / xm-retrievers
View on GitHub
🌏 Modular retrievers for zero-shot multilingual IR.
☆30Mar 6, 2024Updated 2 years ago
enjalot / latent-data-modal
View on GitHub
Using modal.com to process FineWeb-edu data
☆20Apr 11, 2026Updated 3 months ago
lightonai / pylate-rs
View on GitHub
PyLate efficient inference engine
☆87Jan 7, 2026Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AmenRa / indxr
View on GitHub
A Python utility for indexing file lines. Best demo honourable mention at ECIR 2024.
☆23Nov 9, 2025Updated 8 months ago
raphaelsty / neural-cherche
View on GitHub
Neural Search
☆371Mar 11, 2025Updated last year
MinishLab / semhash
View on GitHub
Fast Multimodal Semantic Deduplication & Filtering
☆953May 24, 2026Updated 2 months ago
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
davidberenstein1957 / fast-sentence-transformers
View on GitHub
Simply, faster, sentence-transformers
☆144Aug 27, 2024Updated last year
tonywu71 / colpali-cookbooks
View on GitHub
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆357Jun 2, 2025Updated last year
shauli-ravfogel / descriptions
View on GitHub
☆10May 11, 2024Updated 2 years ago
AnswerDotAI / fastdata
View on GitHub
☆160Dec 2, 2024Updated last year
SeanLee97 / AnglE
View on GitHub
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
☆573Mar 22, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jxmorris12 / bm25_pt
View on GitHub
minimal pytorch implementation of bm25 (with sparse tensors)
☆105Oct 28, 2025Updated 8 months ago
hanxiao / flash-kmeans-mlx
View on GitHub
IO-aware batched K-Means for Apple Silicon, ported from Flash-KMeans (Triton/CUDA) to pure MLX. Up to 94x faster than sklearn.
☆17Mar 22, 2026Updated 4 months ago
querqy / querqy-unplugged
View on GitHub
☆16Jun 28, 2026Updated 3 weeks ago
illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,707Jul 13, 2026Updated last week
DeployQL / LintDB
View on GitHub
Vector Database with support for late interaction and token level embeddings.
☆54Jun 20, 2025Updated last year
minimaxir / pokemon-embeddings
View on GitHub
Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.
☆20Jun 26, 2024Updated 2 years ago
sebastian-hofstaetter / colberter
View on GitHub
☆47Mar 27, 2022Updated 4 years ago