DavidGrangier / wikipedia-biography-dataset
This dataset gathers 728,321 biographies from wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).
☆156Updated 8 years ago
Alternatives and similar repositories for wikipedia-biography-dataset:
Users that are interested in wikipedia-biography-dataset are comparing it to the libraries listed below
- Code for "Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation" (NAACL 2018)☆92Updated 6 years ago
- Code from the paper "Step-by-Step: Separating Planning from Realization in Neural Data-to-Text Generation - NAACL-2019.☆127Updated last year
- ☆113Updated 2 years ago
- Codes for <CGMH: Constrained Sentence Generation by Metropolis-Hastings Sampling>☆162Updated 5 years ago
- ☆156Updated 5 years ago
- ☆177Updated 6 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- A toolkit for evaluating the linguistic knowledge and transferability of contextual representations. Code for "Linguistic Knowledge and T…☆210Updated 3 years ago
- Code for "MojiTalk: Generating Emotional Responses at Scale" https://arxiv.org/abs/1711.04090☆121Updated 6 years ago
- ☆209Updated 4 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆154Updated 3 years ago
- ☆112Updated 5 years ago
- Code for "Controllable Paraphrase Generation with a Syntactic Exemplar" (ACL 2019)☆80Updated 5 years ago
- Text Simplification System and Dataset☆124Updated last year
- Official Github repo for the paper "Unifying Human and Statistical Evaluation for Natural Language Generation"☆73Updated 5 years ago
- NLP research experiments, built on PyTorch within the AllenNLP framework.☆91Updated 10 months ago
- semantic summarization using abstract meaning representation (AMR)☆74Updated 9 years ago
- Code for AAAI2018 paper "Table-to-text Generation by Structure-aware Seq2seq Learning"☆152Updated 2 years ago
- Implements the paper " Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models" by Serban et al (current…☆116Updated 6 years ago
- NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation☆69Updated 11 months ago
- Python wrapper for evaluating summarization quality by ROUGE package☆164Updated 4 years ago
- Linguistically-Informed Self-Attention implemented in TensorFlow☆201Updated 5 years ago
- AMR Parsing as Sequence-to-Graph Transduction☆154Updated 6 months ago
- ☆138Updated 3 years ago
- ☆53Updated 4 years ago
- Code corresponding to our paper "A Graph-to-Sequence Model for AMR-to-Text Generation"☆138Updated 3 years ago
- CMU Document Grounded Conversation Dataset☆110Updated 6 years ago
- ProPara (Process Paragraph Comprehension) dataset and models☆81Updated 5 years ago
- Code for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"☆150Updated 6 years ago
- ☆168Updated 6 years ago