Alab-NII / 2wikimultihop
☆81Updated last year
Alternatives and similar repositories for 2wikimultihop:
Users that are interested in 2wikimultihop are comparing it to the libraries listed below
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆103Updated 7 months ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆66Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆77Updated 2 months ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆57Updated last year
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆45Updated 3 months ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆79Updated last year
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 5 months ago
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆82Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- ☆85Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆58Updated last year
- ☆44Updated 9 months ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆21Updated 2 months ago
- A comprehensive paper list of Reasoning over Tables.☆27Updated 2 years ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆68Updated last month
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated last year
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆90Updated 9 months ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- ACL'23: Unified Demonstration Retriever for In-Context Learning☆34Updated last year
- Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…☆109Updated 2 years ago
- ☆28Updated 11 months ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- This is the code for the Submission 3358 at NeurIPS 2022.☆21Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆93Updated last year
- ☆44Updated last year
- ☆81Updated last year
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆27Updated 10 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 6 months ago