Crispy reranking models by Mixedbread
☆51Sep 17, 2025Updated 7 months ago
Alternatives and similar repositories for mxbai-rerank
Users that are interested in mxbai-rerank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- ☆14Jun 25, 2024Updated last year
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆19Mar 23, 2024Updated 2 years ago
- Implementation of ModernBERT in MLX☆21Jan 7, 2026Updated 4 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Better Live Text for MacOS☆35Feb 8, 2026Updated 2 months ago
- mixedbread ai python sdk☆12Jul 1, 2024Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆210Aug 31, 2024Updated last year
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆15Apr 6, 2026Updated last month
- ☆57Jul 10, 2025Updated 9 months ago
- Late Interaction Models Training & Retrieval☆796Updated this week
- NLP with Rust for Python 🦀🐍☆73May 13, 2025Updated 11 months ago
- My NER Experiments with ModernBERT and Ettin☆27Jul 17, 2025Updated 9 months ago
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 4 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆129Dec 29, 2025Updated 4 months ago
- ☆20Jan 3, 2025Updated last year
- ☆12Feb 22, 2024Updated 2 years ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Sep 21, 2023Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆65Updated this week
- Efficient BM25 with DuckDB 🦆☆67Dec 20, 2024Updated last year
- Rust crate for submitting inference requests to machine learning models☆15May 24, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 5 months ago
- High-Performance Engine for Multi-Vector Search☆249Apr 22, 2026Updated 2 weeks ago
- Interface for interacting with Gradient AI in Python☆15Jun 28, 2024Updated last year
- Bringing BERT into modernity via both architecture changes and scaling☆1,668Mar 1, 2026Updated 2 months ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- Nearly Inference Free Embeddings: make your RAG queries 500x faster☆77Apr 27, 2026Updated last week
- Open Letter to University Leaders☆19Apr 6, 2020Updated 6 years ago
- Multi-Agent Reinforcement Learning Environment for the card game SkyJo, compatible with PettingZoo and RLLIB☆16Feb 21, 2026Updated 2 months ago
- An Efficent BPE Algorithm Faster then Hugging Face Tokenizer's Implementation☆13Sep 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆44Sep 9, 2025Updated 7 months ago
- ☆10Jun 29, 2021Updated 4 years ago
- 使用Sentencepiece对中文语料进行分词☆13Nov 30, 2023Updated 2 years ago
- Benchmark for Japanese document embedding & vector search☆29Mar 12, 2024Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆17May 21, 2025Updated 11 months ago