nelson-liu / flatten_gigaword
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 7 years ago
Alternatives and similar repositories for flatten_gigaword:
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 7 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Updated 4 years ago
- ☆44Updated 7 years ago
- Code and data corresponding to "Hypothesis Only Baselines in Natural Language Inference" (StarSem 2018)☆25Updated 2 years ago
- Text generation with entities as context☆30Updated 6 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- EMNLP DiscoEval paper☆42Updated 5 years ago
- ☆36Updated 5 years ago
- Question-Answer Meaning Representation☆48Updated 3 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Updated 4 years ago
- ☆58Updated 6 years ago
- ☆27Updated 7 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- ☆38Updated 5 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆25Updated 5 years ago
- An implementation of semi-supervised VAE for morphology reinflection.☆26Updated 5 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 5 years ago
- Parser for Abstract Meaning Representation☆45Updated 4 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Updated 4 years ago
- ☆28Updated 9 months ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆14Updated 4 years ago