IBM / mt-rag-benchmarkLinks
Multi-Turn RAG Benchmark
☆119Updated last week
Alternatives and similar repositories for mt-rag-benchmark
Users that are interested in mt-rag-benchmark are comparing it to the libraries listed below
Sorting:
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆189Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆223Updated last year
- ☆138Updated 2 years ago
- ☆187Updated 7 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆189Updated 4 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆51Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62Updated 2 years ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆59Updated 4 months ago
- ☆43Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Updated last month
- Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering☆37Updated 2 years ago
- ☆51Updated last year
- Code implementation of synthetic continued pretraining☆148Updated last year
- ☆59Updated 2 months ago
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆54Updated 2 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆129Updated last year
- Token-level Reference-free Hallucination Detection☆98Updated 2 years ago
- The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]☆48Updated last year
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆83Updated 3 weeks ago
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆83Updated 3 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Updated last month
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆75Updated 3 years ago
- A toolkit for building dense retrievers with deep language models.☆64Updated 4 years ago
- Codes and packages for the paper titled Evaluating Retrieval Quality in Retrieval-Augmented Generation.☆30Updated 8 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆112Updated 2 years ago
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆124Updated last year
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆75Updated last year
- A Survey of Attributions for Large Language Models☆222Updated 3 weeks ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆217Updated 7 months ago