nelson-liu / flatten_gigaword
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 7 years ago
Alternatives and similar repositories for flatten_gigaword:
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Updated 4 years ago
- ☆37Updated 5 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 2 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Various utility scripts useful for natural language processing, machine translation, etc.☆48Updated 2 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆91Updated 6 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 5 years ago
- ☆25Updated 3 years ago
- Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning…☆36Updated 4 years ago
- Code for the ACL 2018 paper "Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context"☆54Updated 6 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 7 years ago
- ☆59Updated 6 years ago
- Code and data corresponding to "Hypothesis Only Baselines in Natural Language Inference" (StarSem 2018)☆25Updated 2 years ago
- ☆44Updated 7 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- ☆31Updated 2 months ago
- EMNLP DiscoEval paper☆43Updated 5 years ago
- Summarization datasets from the New York Times Annotated Corpus☆47Updated 4 years ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Updated 5 years ago
- Text generation with entities as context☆30Updated 6 years ago
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆23Updated 6 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- ☆15Updated 7 years ago
- Parser for Abstract Meaning Representation☆45Updated 4 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Official code of our NAACL 2019 paper on Zero-Shot Cross-Lingual Transfer with Order Differences☆18Updated 5 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- ☆18Updated 7 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 3 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago