Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.00456)
☆50Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for usr
Users that are interested in usr are comparing it to the libraries listed below
Sorting:
- ☆23Dec 8, 2022Updated 3 years ago
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems☆16Jun 8, 2021Updated 4 years ago
- Code for the paper "Learning an Unreferenced Metric for Online Dialogue Evaluation", ACL 2020☆28Jul 22, 2023Updated 2 years ago
- ☆63Oct 30, 2022Updated 3 years ago
- Code for SIGdial 2020 paper: Unsupervised Evaluation of Interactive Dialog with DialoGPT (https://arxiv.org/abs/2006.12719)☆29Jun 8, 2020Updated 5 years ago
- GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems☆56Dec 9, 2020Updated 5 years ago
- ☆15Nov 30, 2020Updated 5 years ago
- The Official Repository for the Automatic Dialogue Evaluation Sub-task of DSTC10 Track 5 (Automatic Evaluation and Moderation of Open-dom…☆19Nov 1, 2021Updated 4 years ago
- ☆13Sep 20, 2020Updated 5 years ago
- Towards Quantifiable Dialogue Coherence Evaluation (ACL 2021)☆58Oct 26, 2021Updated 4 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- Dialogue Natural Language Inference with bert classifier☆21Dec 2, 2020Updated 5 years ago
- Code Repository For ACL2021 Paper - DynaEval: Unifying Turn and Dialogue Level Evaluation☆13Sep 2, 2022Updated 3 years ago
- NLG and NLU for dialogue processing☆41Jun 17, 2023Updated 2 years ago
- An unreferenced image captioning metric (ACL-21)☆30Apr 28, 2024Updated last year
- ☆49Jun 12, 2023Updated 2 years ago
- ☆19Jun 7, 2021Updated 4 years ago
- Factual consistency checking model for abstractive summaries (NAACL-22 Findings)☆30May 7, 2022Updated 3 years ago
- The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"☆74Sep 21, 2024Updated last year
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆24Aug 13, 2022Updated 3 years ago
- ☆23Oct 23, 2025Updated 4 months ago
- Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evalua…☆15Aug 13, 2020Updated 5 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆48Jan 21, 2025Updated last year
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆345Nov 11, 2024Updated last year
- "Target-Guided Open-Domain Conversation" in ACL 2019☆147Jun 6, 2019Updated 6 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Jun 30, 2021Updated 4 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…☆32Sep 23, 2018Updated 7 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- ☆16Dec 10, 2022Updated 3 years ago
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆11Oct 13, 2023Updated 2 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- Repository for the CODAH dataset☆22Oct 29, 2022Updated 3 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Jul 25, 2023Updated 2 years ago
- Code for "Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models (CoNLL 2018)"☆15Feb 6, 2019Updated 7 years ago