lil-lab / newsroom
Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/
☆146Updated last year
Related projects ⓘ
Alternatives and complementary repositories for newsroom
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆153Updated 3 years ago
- ☆112Updated 4 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆98Updated 4 years ago
- ☆139Updated 3 years ago
- Text Simplification System and Dataset☆123Updated last year
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆124Updated 5 years ago
- Python wrapper for evaluating summarization quality by ROUGE package☆166Updated 4 years ago
- Calculating ROUGE score between two files (line-by-line)☆191Updated 3 years ago
- Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.☆157Updated 5 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 7 years ago
- NLP research experiments, built on PyTorch within the AllenNLP framework.☆91Updated 8 months ago
- ☆111Updated 2 years ago
- Heuristic Analysis for NLI Systems☆125Updated 3 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆251Updated 3 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆61Updated last year
- Summarization datasets from the New York Times Annotated Corpus☆47Updated 4 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆102Updated 11 months ago
- This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms. For each article, we provide t…☆157Updated 8 years ago
- Large scale sentential paraphrases collection and annotation☆47Updated last year
- Code from the paper "Step-by-Step: Separating Planning from Realization in Neural Data-to-Text Generation - NAACL-2019.☆127Updated last year
- ☆229Updated 3 years ago
- ☆178Updated 5 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Updated last year
- Pre-processing and in some cases downloading of datasets for the paper "Content Selection in Deep Learning Models of Summarization."☆78Updated 2 years ago
- List of NLP (Natural Language Processing) Corpora.☆64Updated 5 years ago
- Counter-fitting Word Vectors to Linguistic Constraints☆144Updated 4 years ago
- This is the reference implementation of commonly used coreference metrics.☆74Updated 6 years ago
- Unsupervised sentence summarization by contextual matching☆47Updated 2 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- Scientific Document Summarization Corpus and Annotations from the WING NUS group.☆212Updated last year