Shikib / usr
Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.00456)
☆50Updated last year
Related projects ⓘ
Alternatives and complementary repositories for usr
- ☆58Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- Code for SIGdial 2020 paper: Unsupervised Evaluation of Interactive Dialog with DialoGPT (https://arxiv.org/abs/2006.12719)☆28Updated 4 years ago
- ☆41Updated 3 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆60Updated last year
- ☆48Updated last year
- The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"☆74Updated 2 months ago
- Posterior Control of Blackbox Generation☆23Updated 4 years ago
- ☆42Updated 3 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- ☆61Updated 2 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆45Updated 2 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated last year
- Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".☆59Updated 4 years ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆70Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated last year
- NLG and NLU for dialogue processing☆43Updated last year
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- ☆53Updated last year
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 2 years ago
- ☆60Updated last year
- ☆70Updated 3 years ago
- Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering (Kim et al., ACL 2021)☆31Updated last year
- FRANK: Factuality Evaluation Benchmark☆52Updated last year
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆71Updated 3 years ago
- Zero-shot dialogue state tracking (DST)☆82Updated 3 years ago
- [EMNLP 2020] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading☆36Updated 2 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆58Updated 2 years ago
- ☆27Updated last year