Alab-NII / 2wikimultihop
☆96Updated last year
Alternatives and similar repositories for 2wikimultihop:
Users that are interested in 2wikimultihop are comparing it to the libraries listed below
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆121Updated 9 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆52Updated last month
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆94Updated last month
- Repository for Decomposed Prompting☆87Updated last year
- ☆86Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆67Updated 8 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆67Updated 3 years ago
- ☆28Updated last year
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆57Updated last year
- ☆175Updated 2 years ago
- [ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…☆60Updated 8 months ago
- ☆45Updated 11 months ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆82Updated 2 weeks ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆26Updated last year
- https://acl2023-retrieval-lm.github.io/☆153Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆83Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆67Updated 11 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated last month
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆51Updated last year
- A toolkit for building dense retrievers with deep language models.☆60Updated 3 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆59Updated last year
- Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"☆51Updated 5 months ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆22Updated 4 months ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆46Updated last year
- The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shen…☆119Updated last year
- An Open-Source Package for Information Retrieval☆162Updated last month