nelson-liu / flatten_gigaword
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 7 years ago
Alternatives and similar repositories for flatten_gigaword:
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆92Updated 6 years ago
- ☆36Updated 5 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Updated 4 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Updated 5 years ago
- EMNLP DiscoEval paper☆42Updated 5 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆48Updated 2 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 7 years ago
- ☆15Updated 7 years ago
- Tool to perform paired evaluation of automatic systems☆12Updated 3 years ago
- Code for the ACL 2018 paper "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"☆54Updated 6 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- The library that uses dependency parsing to preprocess text to train DisSent model☆33Updated 4 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- This repository contains the the code from "Globally Coherent Text Generation with Neural Checklist Models" by Chloe Kiddon, Luke Zettlem…☆40Updated 3 years ago
- Companion site for "Analysis Methods in Neural Language Processing: A Survey"☆66Updated 4 years ago
- Text generation with entities as context☆30Updated 6 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆81Updated 3 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 5 years ago
- Reasoning over Multiple Sentences (Multi-RC)☆33Updated 4 years ago
- ☆53Updated 4 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆14Updated 4 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆40Updated 6 years ago
- Question-Answer Meaning Representation☆48Updated 3 years ago