mixedbread-ai/maxsim-cpu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mixedbread-ai/maxsim-cpu)

mixedbread-ai / maxsim-cpu

☆57

Alternatives and similar repositories for maxsim-cpu

Users that are interested in maxsim-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lightonai / pylate-rs
View on GitHub
PyLate efficient inference engine
☆87Jan 7, 2026Updated 6 months ago
TusKANNy / awesome-multivector-retrieval
View on GitHub
An extensive and commented list of resources on Late-Interaction Multivector Retrieval.
☆67Jul 8, 2026Updated last week
AnswerDotAI / fastkmeans
View on GitHub
☆101Jul 4, 2025Updated last year
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆268May 28, 2026Updated last month
searchivarius / py_mtasklite
View on GitHub
A missing piece of the Python multitask (both threads and processes) API: An extension that supports stateful worker pools & size-aware i…
☆29Mar 8, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mixedbread-ai / mxbai-rerank
View on GitHub
Crispy reranking models by Mixedbread
☆52Sep 17, 2025Updated 10 months ago
JHU-CLSP / ettin-encoder-vs-decoder
View on GitHub
State-of-the-art paired encoder and decoder models (17M-1B params)
☆74Aug 6, 2025Updated 11 months ago
ejaasaari / lemur
View on GitHub
[ICML'26] LEMUR reduces multi-vector retrieval for late interaction models such as ColBERT into regular single-vector retrieval.
☆31Jun 21, 2026Updated last month
TusKANNy / tachiom
View on GitHub
Official repository of TACHIOM.
☆62Updated this week
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆875Jul 13, 2026Updated last week
hseb-benchmark / hseb
View on GitHub
HSEB: Hybrid Search Engine Benchmark
☆21Oct 5, 2025Updated 9 months ago
raphaelsty / LeNLP
View on GitHub
NLP with Rust for Python 🦀🐍
☆72Jun 9, 2026Updated last month
jlscheerer / xtr-warp
View on GitHub
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆209May 3, 2025Updated last year
DBGroup-SUSTech / multi-vector-retrieval
View on GitHub
☆15Apr 19, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sigmod26gem / sigmod26gem
View on GitHub
☆17Mar 13, 2026Updated 4 months ago
jfkback / hypencoder-paper
View on GitHub
Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"
☆40Sep 20, 2025Updated 10 months ago
recombee / CompresSAE
View on GitHub
Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
☆39Nov 21, 2025Updated 7 months ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
TusKANNy / awesome-learned-sparse-retrieval
View on GitHub
An extensive and commented list of resources on Learned Sparse Retrieval.
☆63Updated this week
illuin-tech / contextual-embeddings
View on GitHub
Model implementation for the contextual embeddings project
☆47Jun 2, 2025Updated last year
flairNLP / familiarity
View on GitHub
Label shift estimation for transfer difficulty with Familiarity.
☆10Feb 4, 2025Updated last year
lightonai / next-plaid
View on GitHub
NextPlaid, ColGREP: Multi-vector search, from database to coding agents.
☆516Jul 8, 2026Updated last week
fresh-stack / freshstack
View on GitHub
This repository helps you evaluate your models on the FreshStack benchmark!
☆34Dec 9, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
viig99 / muvfde
View on GitHub
Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.
☆20Jun 28, 2025Updated last year
stefan-it / modern-bert-ner
View on GitHub
My NER Experiments with ModernBERT and Ettin
☆29Jul 17, 2025Updated last year
embeddings-benchmark / arena
View on GitHub
Code for the MTEB Arena
☆25Jul 2, 2025Updated last year
mixedbread-ai / ofen
View on GitHub
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆17Oct 2, 2024Updated last year
hotchpotch / JQaRA
View on GitHub
JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット
☆44Sep 9, 2025Updated 10 months ago
TusKANNy / seismic
View on GitHub
Official repository of the Seismic library.
☆135Jul 6, 2026Updated 2 weeks ago
lightonai / bm25x
View on GitHub
A fast, streaming-friendly BM25 search engine in Rust with mmap support
☆54Mar 19, 2026Updated 4 months ago
flipz357 / S3BERT
View on GitHub
Learning Semantically Structured Text and Sentence Embeddings
☆71Mar 9, 2026Updated 4 months ago
mixedbread-ai / binary-embeddings
View on GitHub
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆19Mar 23, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
thakur-nandan / sprint
View on GitHub
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
☆48Jul 25, 2023Updated 2 years ago
Muhtasham / llm-inference-simulator
View on GitHub
🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.
☆14Jul 12, 2025Updated last year
keunwoochoi / tokenizer-vs-tokenizer
View on GitHub
☆14Oct 18, 2023Updated 2 years ago
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,624Dec 20, 2025Updated 7 months ago
orionw / promptriever
View on GitHub
The first dense retrieval model that can be prompted like an LM
☆93May 8, 2025Updated last year
yuhuifishash / NGFix
View on GitHub
[SIGMOD'26] Dynamically Detect and Fix Hardness for Efficient Approximate Nearest Neighbor Search
☆19Nov 9, 2025Updated 8 months ago
facebookresearch / sira
View on GitHub
Superintelligent Retrieval Agent (SIRA)
☆134Jun 4, 2026Updated last month