embeddings-benchmark / mtebLinks

MTEB: Massive Text Embedding Benchmark

☆2,753

Alternatives and similar repositories for mteb

Users that are interested in mteb are comparing it to the libraries listed below

Sorting:

AkariAsai / self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,154Updated last year
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,833Updated last week
beir-cellar / beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆1,908Updated 2 months ago
McGill-NLP / llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
☆1,565Updated 6 months ago
parthsarthi03 / raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
☆1,339Updated 11 months ago
IntelLabs / fastRAG
Efficient Retrieval Augmentation and Generation Framework
☆1,625Updated 6 months ago
xlang-ai / instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
☆1,992Updated 6 months ago
stanford-futuredata / ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,525Updated last week
gkamradt / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆1,956Updated 11 months ago
huggingface / datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆2,524Updated last week
xhluca / bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
☆1,267Updated 2 months ago
Tongji-KGLLM / RAG-Survey
☆2,056Updated last year
stanford-crfm / helm
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models …
☆2,389Updated this week
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago
huggingface / text-embeddings-inference
A blazing fast inference solution for text embeddings models
☆3,857Updated 2 weeks ago
noamgat / lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
☆1,861Updated 5 months ago
microsoft / LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,082Updated last month
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,825Updated 7 months ago
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,473Updated last month
stanford-futuredata / ARES
Automated Evaluation of RAG Systems
☆637Updated 4 months ago
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,793Updated this week
hendrycks / test
Measuring Massive Multitask Language Understanding | ICLR 2021
☆1,464Updated 2 years ago
philschmid / deep-learning-pytorch-huggingface
☆1,256Updated 5 months ago
THUDM / AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆2,713Updated 6 months ago
explodinggradients / ragas
Supercharge Your LLM Application Evaluations 🚀
☆10,164Updated last week
bigscience-workshop / promptsource
Toolkit for creating, sharing and using natural language prompts.
☆2,917Updated last year
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,553Updated last year
stanfordnlp / pyreft
Stanford NLP Python library for Representation Finetuning (ReFT)
☆1,503Updated 6 months ago
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,299Updated last week
FlagOpen / FlagEmbedding
Retrieval and Retrieval-augmented LLMs
☆10,302Updated 3 weeks ago