thu-coai / OpenMEVA
Benchmark for evaluating open-ended generation
☆44Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for OpenMEVA
- ☆29Updated last year
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆88Updated 11 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 4 months ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 2 years ago
- Codes for our paper "CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation" (ACL 2022)☆32Updated 2 years ago
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆28Updated last month
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆95Updated last year
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆42Updated last year
- ☆90Updated 7 months ago
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆17Updated 2 years ago
- ☆58Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆97Updated last year
- ☆48Updated last year
- ☆36Updated 7 months ago
- ☆60Updated last year
- Code for ACL 2020 paper: USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation (https://arxiv.org/pdf/2005.0045…☆50Updated last year
- ☆80Updated last year
- Repository for ACL 2022 paper Mix and Match: Learning-free Controllable Text Generation using Energy Language Models☆41Updated 2 years ago
- UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation☆57Updated 4 years ago
- TBC☆26Updated 2 years ago
- ☆27Updated 10 months ago
- ☆70Updated 3 years ago
- Detect hallucinated tokens for conditional sequence generation.☆63Updated 2 years ago
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Updated 2 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆62Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- ☆44Updated last year
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆13Updated 3 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆45Updated 2 years ago
- The Official Repository for the Automatic Dialogue Evaluation Sub-task of DSTC10 Track 5 (Automatic Evaluation and Moderation of Open-dom…☆19Updated 3 years ago