nelson-liu / flatten_gigawordLinks
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
Sorting:
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆49Updated 3 years ago
- Text Simplification System and Dataset☆124Updated 2 years ago
- ☆15Updated 7 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆155Updated 3 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 7 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Updated 2 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆97Updated 5 years ago
- ☆230Updated 4 years ago
- Heuristic Analysis for NLI Systems☆127Updated 4 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago
- ☆33Updated last month
- This is the Grammarly's Yahoo Answers Formality Corpus☆107Updated 3 months ago
- ☆141Updated 4 years ago
- ☆59Updated 2 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Updated 5 years ago
- Official code of our NAACL 2019 paper on Zero-Shot Cross-Lingual Transfer with Order Differences☆18Updated 6 years ago
- This is the reference implementation of commonly used coreference metrics.☆75Updated 7 years ago
- Fast Discourse Parser to find latent Rhetorical STructure (RST) in text.☆126Updated last year
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- Assessing syntactic abilities of BERT☆149Updated 6 years ago
- Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/☆150Updated 2 years ago
- Data and code used in the 2015 ACL paper, "Ground Truth for Grammatical Error Correction Metrics"☆53Updated 7 years ago
- ☆27Updated 7 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 8 years ago
- Diagnostic tests for linguistic capacities in language models☆65Updated 3 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago