thu-coai / OpenMEVA
Benchmark for evaluating open-ended generation
☆48Updated 6 months ago
Alternatives and similar repositories for OpenMEVA
Users that are interested in OpenMEVA are comparing it to the libraries listed below
Sorting:
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆31Updated 2 years ago
- UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation☆58Updated 4 years ago
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated 2 years ago
- ☆90Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆42Updated 3 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 2 months ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆45Updated 2 years ago
- ☆32Updated last month
- The Official Repository for the Automatic Dialogue Evaluation Sub-task of DSTC10 Track 5 (Automatic Evaluation and Moderation of Open-dom…☆19Updated 3 years ago
- ☆62Updated 2 years ago
- ☆71Updated 3 years ago
- Code repository for our EMNLP 2020 long paper "Modeling Protagonist Emotions for Emotion-Aware Storytelling" (https://arxiv.org/abs/2010.…☆20Updated 4 years ago
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluatio…☆77Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆71Updated 11 months ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated 2 years ago
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆120Updated last year
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆96Updated 2 years ago
- Authors' implementation of the paper Adaptive Information Seeking for Open-Domain Question Answering, published in EMNLP 2021.☆37Updated 2 years ago
- ☆26Updated 2 years ago
- ☆48Updated 2 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆62Updated last year
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆96Updated 3 years ago
- Codes for our ACL21 paper: Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization☆94Updated 3 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago