embeddings-benchmark / mtebpaper
Resources & scripts for the paper "MTEB: Massive Text Embedding Benchmark"
☆15Updated last month
Related projects ⓘ
Alternatives and complementary repositories for mtebpaper
- A framework for few-shot evaluation of autoregressive language models.☆101Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 7 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆78Updated 3 months ago
- Repo for "On Learning to Summarize with Large Language Models as References"☆42Updated last year
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆93Updated last year
- ☆121Updated 2 months ago
- Dense hybrid representations for text retrieval☆61Updated last year
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆36Updated 8 months ago
- Token-level Reference-free Hallucination Detection☆92Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆37Updated last year
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆110Updated 8 months ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆75Updated 7 months ago
- Comprehensive benchmark for RAG☆31Updated last week
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆95Updated 5 months ago
- ☆33Updated last year
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆71Updated 2 weeks ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆131Updated 10 months ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆49Updated 10 months ago
- Multilingual Large Language Models Evaluation Benchmark☆105Updated 2 months ago
- A toolkit for building dense retrievers with deep language models.☆53Updated 3 years ago
- Retrieval-Augmented Generation battle!☆44Updated last month
- ☆55Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆58Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆65Updated 8 months ago
- ☆63Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆39Updated 10 months ago
- ☆95Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆114Updated last month
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆54Updated last year