dig-team / hanna-benchmark-asg
HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.
☆25Updated last month
Related projects: ⓘ
- ☆28Updated last year
- ☆80Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆38Updated 9 months ago
- Code base of In-Context Learning for Dialogue State tracking☆43Updated 11 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆59Updated 2 months ago
- The dataset and code for PeerSum at EMNLP'23.☆14Updated 9 months ago
- FRANK: Factuality Evaluation Benchmark☆51Updated last year
- Benchmark for evaluating open-ended generation☆44Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆56Updated last year
- TBC☆26Updated last year
- ☆48Updated last year
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆33Updated last year
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆19Updated 5 months ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆32Updated 2 years ago
- ☆64Updated 7 months ago
- ☆57Updated 2 years ago
- ☆19Updated 2 years ago
- ☆13Updated 9 months ago
- Code and data for the FACTOR paper☆36Updated 10 months ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated last year
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆60Updated last year
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆52Updated last year
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆25Updated 10 months ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆27Updated 2 years ago
- ☆32Updated last year
- ☆40Updated this week
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆17Updated 5 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆41Updated last year