embeddings-benchmark / leaderboard
Code for the MTEB leaderboard
☆24Updated 2 months ago
Alternatives and similar repositories for leaderboard:
Users that are interested in leaderboard are comparing it to the libraries listed below
- ☆62Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆56Updated 7 months ago
- Pre-train Static Word Embeddings☆56Updated last week
- Crispy reranking models by Mixedbread☆22Updated last month
- ☆41Updated 2 months ago
- Query Expension for Better Query Embedding using LLMs☆47Updated 2 months ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated last month
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆42Updated 9 months ago
- ☆53Updated 10 months ago
- ☆32Updated this week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆68Updated last week
- ☆48Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- A RAG that can scale 🧑🏻💻☆11Updated 10 months ago
- Supercharge huggingface transformers with model parallelism.☆76Updated 6 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆45Updated last week
- minimal pytorch implementation of bm25 (with sparse tensors)☆100Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- An introduction to LLM Sampling☆77Updated 4 months ago
- Source code of our paper "PairDistill: Pairwise Relevance Distillation for Dense Retrieval", EMNLP 2024 Main.☆22Updated 4 months ago
- ☆47Updated 7 months ago
- A curated list of awesome papers about utilizing large language models for ranking.☆15Updated 5 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆65Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆63Updated 8 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated last month
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 3 months ago
- ☆42Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 10 months ago