OriShapira / LitePyramids
Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.
☆12Updated 5 years ago
Alternatives and similar repositories for LitePyramids:
Users that are interested in LitePyramids are comparing it to the libraries listed below
- ☆32Updated 3 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- ☆15Updated 3 years ago
- ☆10Updated 5 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆20Updated 2 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 5 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆79Updated 3 years ago
- ☆33Updated last year
- EMNLP DiscoEval paper☆43Updated 5 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- ☆46Updated 5 years ago
- ☆12Updated 5 years ago
- This is the repository for the Interspeech 2018 paper "Coherence models for dialogue".☆19Updated 5 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆14Updated 3 years ago
- ☆24Updated 10 months ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆34Updated last year
- ☆46Updated last year
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Updated 4 years ago
- This repository contains the script to compute the questions based on the Answerability aspect.☆38Updated 5 years ago
- ☆24Updated 2 years ago
- ☆27Updated 2 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated last year
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- Versatile Generative Language Model☆26Updated 2 years ago
- Codebase for probing and visualizing multilingual models.☆48Updated 4 years ago
- ☆17Updated 2 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- ☆39Updated 3 years ago