Yale-LILY / SummerTime
An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo
☆273Updated last year
Alternatives and similar repositories for SummerTime:
Users that are interested in SummerTime are comparing it to the libraries listed below
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆428Updated last year
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆200Updated last year
- SummVis is an interactive visualization tool for text summarization.☆251Updated 2 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆146Updated last year
- Human-free quality estimation of document summaries☆95Updated 6 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- Neural Question Generation using the SQuAD and NewsQA datasets☆109Updated 2 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆299Updated 4 years ago
- Large-scale multi-document summarization dataset and code☆279Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆154Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆171Updated 2 years ago
- ☆186Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆173Updated last month
- NeuralQA: A Usable Library for Question Answering on Large Datasets with BERT☆231Updated last year
- Scripts and links to recreate the ELI5 dataset.☆320Updated 3 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆337Updated 2 months ago
- Question-answers, collected from Google☆125Updated 3 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆382Updated 7 months ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Official code and data repository for our EMNLP 2020 long paper "Reformulating Unsupervised Style Transfer as Paraphrase Generation" (htt…☆233Updated 2 years ago
- A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.☆78Updated 3 years ago
- Text2Text Language Modeling Toolkit☆295Updated 2 weeks ago
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆555Updated 3 years ago
- New dataset☆300Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆159Updated 4 months ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated last year
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆117Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago