google / BEGIN-datasetLinks
A benchmark dataset for evaluating dialog system and natural language generation metrics.
☆39Updated 3 years ago
Alternatives and similar repositories for BEGIN-dataset
Users that are interested in BEGIN-dataset are comparing it to the libraries listed below
Sorting:
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆44Updated 3 years ago
- ☆50Updated 3 years ago
- Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.☆29Updated last year
- Code for Aesop: Paraphrase Generation with Adaptive Syntactic Control (EMNLP 2021)☆26Updated 4 years ago
- FRANK: Factuality Evaluation Benchmark☆59Updated 3 years ago
- ☆58Updated 3 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Updated 3 years ago
- ☆20Updated 3 years ago
- ☆83Updated 2 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Updated 3 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated 2 years ago
- The official implementation for ACL 2021 "Challenges in Information Seeking QA: Unanswerable Questions and Paragraph Retrieval".☆28Updated 4 years ago
- Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.☆38Updated 2 years ago
- ☆13Updated 3 years ago
- Code for EMNLP 2021 paper "CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization"☆46Updated 4 years ago
- ☆17Updated 2 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated 2 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Updated 4 years ago
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14Updated 2 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆49Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- ☆62Updated 3 years ago
- The implementation of the paper "Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters".☆17Updated 3 years ago
- ☆101Updated 3 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆33Updated 3 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 3 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆44Updated 3 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- ☆39Updated 2 years ago
- ☆30Updated 4 years ago