iitmnlp / EvalEval
Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021
☆9Updated 3 years ago
Alternatives and similar repositories for EvalEval:
Users that are interested in EvalEval are comparing it to the libraries listed below
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆12Updated 2 years ago
- Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"☆17Updated 4 years ago
- ☆16Updated 2 years ago
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆18Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆12Updated 2 years ago
- Code and data for paper "On the Robustness of Reading Comprehension Models to Entity Renaming" (NAACL'22)☆11Updated last year
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆36Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆17Updated last year
- ☆21Updated 3 years ago
- Code for ModularQA☆28Updated 3 years ago
- ☆14Updated last year
- Consistent dialogue generation☆16Updated 2 years ago
- ☆15Updated 3 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Updated 2 years ago
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated 9 months ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆11Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 7 months ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 3 years ago
- ☆48Updated 2 years ago
- Create augmentation examples from MultiNLI by subject-object inversion and passivizing.☆17Updated 4 years ago
- ☆29Updated 3 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆24Updated last year
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆12Updated 5 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆66Updated 3 years ago