yale-nlp / ODSum
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆10Updated 6 months ago
Alternatives and similar repositories for ODSum:
Users that are interested in ODSum are comparing it to the libraries listed below
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"☆20Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆25Updated last year
- ☆38Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Updated 3 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated last year
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆41Updated 3 years ago
- ☆13Updated 2 years ago
- ☆34Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆23Updated 6 months ago
- ☆17Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆27Updated 3 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 3 weeks ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆29Updated 2 years ago
- ☆15Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- TBC☆26Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- ☆18Updated 11 months ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆42Updated 5 months ago
- Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.☆37Updated last year
- ☆25Updated 2 years ago
- ☆24Updated 2 years ago
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table☆23Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆12Updated 2 years ago