mixedbread-ai/mxbai-rerank

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mixedbread-ai/mxbai-rerank)

mixedbread-ai / mxbai-rerank

Crispy reranking models by Mixedbread

☆52

Alternatives and similar repositories for mxbai-rerank

Users that are interested in mxbai-rerank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mixedbread-ai / maxsim-cpu
View on GitHub
☆57Jul 10, 2025Updated last year
mixedbread-ai / wiki_demo_app
View on GitHub
☆14Jun 25, 2024Updated 2 years ago
mixedbread-ai / binary-embeddings
View on GitHub
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆19Mar 23, 2024Updated 2 years ago
pappitti / modernbert-mlx
View on GitHub
Implementation of ModernBERT in MLX
☆21Jan 7, 2026Updated 6 months ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
TusKANNy / tachiom
View on GitHub
Official repository of TACHIOM.
☆62Jul 17, 2026Updated last week
mixedbread-ai / python-sdk
View on GitHub
mixedbread ai python sdk
☆12Jul 1, 2024Updated 2 years ago
JHU-CLSP / ettin-encoder-vs-decoder
View on GitHub
State-of-the-art paired encoder and decoder models (17M-1B params)
☆76Aug 6, 2025Updated 11 months ago
AnswerDotAI / msglm
View on GitHub
msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.
☆15Apr 6, 2026Updated 3 months ago
mixedbread-ai / baguetter
View on GitHub
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆211Aug 31, 2024Updated last year
raphaelsty / LeNLP
View on GitHub
NLP with Rust for Python 🦀🐍
☆72Jun 9, 2026Updated last month
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
lightonai / pylate-rs
View on GitHub
PyLate efficient inference engine
☆87Jan 7, 2026Updated 6 months ago
stefan-it / modern-bert-ner
View on GitHub
My NER Experiments with ModernBERT and Ettin
☆29Jul 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AnswerDotAI / fastkmeans
View on GitHub
☆102Jul 4, 2025Updated last year
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
illuin-tech / contextual-embeddings
View on GitHub
Model implementation for the contextual embeddings project
☆47Jun 2, 2025Updated last year
embeddings-benchmark / arena
View on GitHub
Code for the MTEB Arena
☆25Jul 2, 2025Updated last year
searchivarius / py_mtasklite
View on GitHub
A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…
☆29Mar 8, 2026Updated 4 months ago
stephantul / pynife
View on GitHub
Nearly Inference Free Embeddings: make your RAG queries 500x faster
☆80Apr 27, 2026Updated 2 months ago
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆271May 28, 2026Updated last month
Knowledgator / GLiClass.c
View on GitHub
C inference engine for running GLiClass (Generalist and Lightweight Classification) models
☆17May 21, 2025Updated last year
staghado / better-live-text
View on GitHub
Better Live Text for MacOS
☆36Feb 8, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
alessandrobenigni / BM25-Turbo-Rust-Python-WASM-CLI
View on GitHub
The fastest BM25 scoring engine: 2,300x faster than BM25S. 28K QPS on 8.8M docs. 5 BM25 variants (Robertson, Lucene, ATIRE, BM25L, BM25+…
☆46Apr 3, 2026Updated 3 months ago
stephantul / skeletoken
View on GitHub
Datamodels for hugging face tokenizers
☆109Jun 18, 2026Updated last month
speechmatics / ctranslate2_triton_backend
View on GitHub
Triton backend for https://github.com/OpenNMT/CTranslate2
☆35Jul 7, 2023Updated 3 years ago
iliaschalkidis / flash-roberta
View on GitHub
Hugging Face RoBERTa with Flash Attention 2
☆24Sep 14, 2025Updated 10 months ago
fresh-stack / freshstack
View on GitHub
This repository helps you evaluate your models on the FreshStack benchmark!
☆34Dec 9, 2025Updated 7 months ago
NewBornRustacean / muvera-rs
View on GitHub
unofficial implementation of MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings
☆15Feb 18, 2026Updated 5 months ago
penfever / wildchat-50m
View on GitHub
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆39Apr 1, 2025Updated last year
beartype / pytest-beartype
View on GitHub
Pytest plugin type-checking tests, fixtures, and/or your codebase with @beartype.
☆25Updated this week
kuzudb / dspy-kuzu-demo
View on GitHub
Intro to using DSPy with Kuzu to enrich the data within the Nobel Laureate mentorship network
☆16Sep 16, 2025Updated 10 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
datologyai / luxical
View on GitHub
☆81Dec 12, 2025Updated 7 months ago
huggingface / candle-cublaslt
View on GitHub
☆13Feb 22, 2024Updated 2 years ago
cognica-io / bayesian-bm25
View on GitHub
Bayesian probability transforms for BM25 retrieval scores
☆77Jun 20, 2026Updated last month
AnswerDotAI / ModernBERT
View on GitHub
Bringing BERT into modernity via both architecture changes and scaling
☆1,704Mar 1, 2026Updated 4 months ago
o19s / ubi
View on GitHub
User Behavior Insights standard schema specification
☆47Oct 8, 2025Updated 9 months ago
michaelfeil / candle-flash-attn-v3
View on GitHub
☆15Dec 21, 2025Updated 7 months ago
mobarski / aidapter
View on GitHub
Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)
☆20Sep 21, 2023Updated 2 years ago