nelson-liu / flatten_gigaword
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 7 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
Sorting:
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- ☆25Updated 3 years ago
- Question-Answer Meaning Representation☆48Updated 3 years ago
- ☆37Updated 6 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Updated 5 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Updated 4 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 5 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago
- Code for the ACL 2018 paper "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"☆54Updated 7 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Various utility scripts useful for natural language processing, machine translation, etc.☆49Updated 2 years ago
- ☆16Updated 4 years ago
- Reproduction instructions for "Rapid Adaptation of Neural Machine Translation to New Languages"☆41Updated 6 years ago
- TER-plus Machine Translation metric.☆31Updated 2 years ago
- Ordinal Common-sense Inference☆27Updated 7 years ago
- ☆59Updated 7 years ago
- ☆44Updated 7 years ago
- ☆47Updated 7 years ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆14Updated 5 years ago
- Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"☆21Updated 2 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Updated 4 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆63Updated 2 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 3 years ago