nelson-liu / flatten_gigawordLinks
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
☆23Updated 8 years ago
Alternatives and similar repositories for flatten_gigaword
Users that are interested in flatten_gigaword are comparing it to the libraries listed below
Sorting:
- Various utility scripts useful for natural language processing, machine translation, etc.☆50Updated 3 years ago
- Evaluating Text Representations on Lexical Composition☆24Updated 6 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆94Updated 7 years ago
- Evaluating recurrent neural networks on predicting subject-verb agreement dependencies☆63Updated 2 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆155Updated 4 years ago
- This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 …☆98Updated 5 years ago
- Improving cross-lingual word embeddings by meeting in the middle☆23Updated 5 years ago
- ☆15Updated 7 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Updated 2 years ago
- Text Simplification System and Dataset☆125Updated 2 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Updated 3 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 5 years ago
- Heuristic Analysis for NLI Systems☆127Updated 4 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 3 years ago
- Assessing syntactic abilities of BERT☆149Updated 6 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 6 years ago
- The Benchmark of Linguistic Minimal Pairs☆159Updated 3 years ago
- Tools for downloading and analyzing summaries and evaluating summarization systems. https://summari.es/☆151Updated 2 years ago
- Implementation of "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"☆77Updated 4 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 8 years ago
- ☆25Updated 3 years ago
- ☆231Updated 4 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆125Updated 8 years ago
- Official code of our NAACL 2019 paper on Zero-Shot Cross-Lingual Transfer with Order Differences☆18Updated 6 years ago
- ☆29Updated last year
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Updated last year
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- TER-plus Machine Translation metric.☆31Updated 3 years ago
- ☆165Updated 3 years ago