mingdachen / SummScreen
SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)
☆34Updated 2 years ago
Related projects: ⓘ
- ☆28Updated last year
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated last year
- ☆33Updated last year
- ☆16Updated last year
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆32Updated 2 years ago
- Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"☆63Updated 2 years ago
- ☆48Updated last year
- ☆37Updated last year
- ☆45Updated last year
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆51Updated last year
- ☆41Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆51Updated last year
- TBC☆26Updated last year
- The purpose of this repository is to introduce new dialogue-level commonsense inference datasets and tasks. We chose dialogues as the dat…☆62Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆27Updated 2 years ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆56Updated last year
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆43Updated 11 months ago
- ☆32Updated last year
- ☆28Updated 3 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆33Updated last year
- Benchmark for evaluating open-ended generation☆44Updated last year
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆33Updated 2 years ago
- ☆26Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆41Updated last year
- ☆14Updated last year
- Code for paper Document-Level Paraphrase Generation with Sentence Rewriting and Reordering by Zhe Lin, Yitao Cai and Xiaojun Wan. This pa…☆24Updated 2 years ago