embeddings-benchmark/results

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/embeddings-benchmark/results)

embeddings-benchmark / results

Data for the MTEB leaderboard

☆57

Alternatives and similar repositories for results

Users that are interested in results are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

embeddings-benchmark / leaderboard
View on GitHub
Code for the MTEB leaderboard
☆32Feb 4, 2025Updated last year
illuin-tech / contextual-embeddings
View on GitHub
Model implementation for the contextual embeddings project
☆47Jun 2, 2025Updated last year
Mungeryang / colqwen3
View on GitHub
The code used to train and run inference with the ColQwen3 model. Welcome to follow and star! ⭐️⭐️⭐️ https://huggingface.co/goodman2001/…
☆15Jul 4, 2026Updated 2 weeks ago
AIR-Bench / AIR-Bench
View on GitHub
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆167Mar 29, 2026Updated 3 months ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TusKANNy / awesome-multivector-retrieval
View on GitHub
An extensive and commented list of resources on Late-Interaction Multivector Retrieval.
☆68Updated this week
openai / azure-cli
View on GitHub
Azure Command-Line Interface
☆14Mar 26, 2026Updated 3 months ago
hcompai / late-interaction-kernels
View on GitHub
Fused Triton kernels for late-interaction (MaxSim) scoring
☆23Jul 5, 2026Updated 2 weeks ago
isaacus-dev / text2markdown
View on GitHub
text2markdown is a Python library for intelligently converting plain text into Markdown.
☆19Jun 1, 2026Updated last month
UKPLab / eacl2024-lagonn
View on GitHub
Source code and data for Like a Good Nearest Neighbor
☆30Jan 12, 2025Updated last year
oceanumeric / EnteRAG
View on GitHub
A RAG that can scale 🧑🏻‍💻
☆11May 28, 2024Updated 2 years ago
lecs-lab / polygloss
View on GitHub
A massively multilingual corpus and pretrained model for IGT
☆15Jun 4, 2026Updated last month
Shkaolin / BERTopic-as-service
View on GitHub
Using BERTopic as a service to create easily interpretable topics
☆11Feb 6, 2023Updated 3 years ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kwunhang / CatRAG
View on GitHub
CatRAG is a RAG framework builds on the HippoRAG 2 architecture and transforms the static KG into query-adaptive navigation structure. RA…
☆26Apr 14, 2026Updated 3 months ago
embeddings-benchmark / arena
View on GitHub
Code for the MTEB Arena
☆25Jul 2, 2025Updated last year
cwnu-airlab / NLTKor
View on GitHub
☆14Sep 29, 2025Updated 9 months ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
jina-ai / jina-vdr
View on GitHub
Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval
☆38Aug 4, 2025Updated 11 months ago
hotchpotch / yasem
View on GitHub
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
☆13May 22, 2025Updated last year
wjbmattingly / ww2-spacy
View on GitHub
☆17Jan 5, 2023Updated 3 years ago
sonsuhyune / UPEval
View on GitHub
☆15Jul 14, 2026Updated last week
alvarobartt / vertex-ai-huggingface-inference-toolkit
View on GitHub
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Mar 20, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
cbecquart / ICSpyLab
View on GitHub
☆21Jun 15, 2026Updated last month
rasyosef / splade-index
View on GitHub
Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba
☆38Oct 16, 2025Updated 9 months ago
cltl / svm_wsd
View on GitHub
Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the out…
☆12Feb 5, 2019Updated 7 years ago
UT-SysML / rumors-in-multi-agent
View on GitHub
Code for AAAI Workshop WMAC "Paper Simulating Rumor Spreading in Social Networks using LLM agents"
☆13Feb 20, 2025Updated last year
docker / model-spec
View on GitHub
☆21Oct 2, 2025Updated 9 months ago
alvarobartt / opentrain
View on GitHub
🚂 Fine-tune OpenAI models for text classification, question answering, and more
☆17May 1, 2023Updated 3 years ago
Halvani / Alphabetic
View on GitHub
A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …
☆15Jul 19, 2024Updated 2 years ago
DerwenAI / pynock
View on GitHub
A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies
☆21Apr 27, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,366Updated this week
kuk / simple-evals-ru
View on GitHub
Репозиторий измеряет качество Yandexgpt, Gigachat, T-Pro, Saiga, Vikhr, Ruadapt на популярных англоязычных бенчмарках: MGSM, MATH, HumanE…
☆25Apr 16, 2025Updated last year
Paul33333 / Agentic_RAG
View on GitHub
Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API
☆17Jun 21, 2025Updated last year
smpanaro / ModernBERT-AppleNeuralEngine
View on GitHub
ModernBERT model optimized for Apple Neural Engine.
☆38Jan 10, 2025Updated last year
icebaker / obsidian-nano-bots
View on GitHub
Nano Bots for Obsidian: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as…
☆15Jan 13, 2024Updated 2 years ago
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
datarubrics / datarubrics
View on GitHub
DataRubrics, a structured framework for assessing the quality of both human- and model-generated datasets. Leveraging recent advances in …
☆17Jun 6, 2025Updated last year