iitmnlp / EvalEval
Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021
☆9Updated 3 years ago
Alternatives and similar repositories for EvalEval:
Users that are interested in EvalEval are comparing it to the libraries listed below
- Consistent dialogue generation☆16Updated 2 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆39Updated 2 years ago
- The corresponding code from our paper " COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion (ACL …☆18Updated 2 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆41Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆12Updated 2 years ago
- ☆16Updated 2 years ago
- ☆17Updated last year
- ☆21Updated 3 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆25Updated last year
- ☆15Updated 3 years ago
- Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"☆17Updated 5 years ago
- The source code of our ACL paper "A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance an…☆14Updated 2 years ago
- ☆14Updated last year
- Code for ModularQA☆28Updated 3 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Updated 2 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆36Updated 2 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- ☆29Updated 3 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆12Updated 2 years ago
- EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization☆35Updated last year
- ☆12Updated 5 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆28Updated 3 years ago
- Code for the paper "Simulating Bandit Learning from User Feedback for Extractive Question Answering".☆18Updated 2 years ago
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Updated 4 years ago
- [NAACL'22-Findings] Dataset for "Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training"☆18Updated 2 years ago
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆17Updated 3 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatio…☆13Updated last year
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆67Updated 3 years ago