nelson-liu / flatten_gigaword
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆24Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for flatten_gigaword
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 4 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆64Updated 7 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆47Updated 2 years ago
- ☆36Updated 5 years ago
- Specialising Word Vectors for Lexical Entailment☆28Updated 6 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆64Updated last year
- Large scale sentential paraphrases collection and annotation☆47Updated last year
- ☆25Updated 2 years ago
- Assessing syntactic abilities of BERT☆150Updated 5 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆93Updated 6 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 3 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 4 years ago
- Code for the ACL 2018 paper "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"☆55Updated 6 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 3 years ago
- The library that uses dependency parsing to preprocess text to train DisSent model☆33Updated 4 years ago
- Witwicky: An implementation of Transformer in PyTorch.☆22Updated 4 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Updated 3 months ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆61Updated last year
- Text generation with entities as context☆31Updated 6 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆44Updated 4 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- ☆19Updated 4 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 4 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Updated 4 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆13Updated 4 years ago
- ☆15Updated 6 years ago
- Dynet-based Biaffine Parser☆33Updated 5 years ago
- OpenNMT based Neural Conversation model which implements Topic and Semantic Distributional Constraints to improve quality of generated re…☆27Updated 6 years ago
- NLI test set with lexical inferences☆48Updated 6 years ago