FrancisGregoire / parSentExtractLinks
A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.
☆33Updated 6 years ago
Alternatives and similar repositories for parSentExtract
Users that are interested in parSentExtract are comparing it to the libraries listed below
Sorting:
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Resources for the OpenNMT hackathon☆51Updated 6 years ago
- Neural models and instructions on how to reproduce our results for our neural grammatical error correction systems from M. Junczys-Dowmun…☆88Updated 6 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆122Updated 2 years ago
- TER-plus Machine Translation metric.☆31Updated 3 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 4 years ago
- Dynamic data selection for neural machine translation☆20Updated 7 years ago
- ☆34Updated 8 years ago
- Pipelined quality estimation.☆51Updated 5 years ago
- Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"☆38Updated 7 years ago
- Language modeling scripts based on TensorFlow☆58Updated 5 years ago
- eXtensible Neural Machine Translation☆187Updated 5 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Updated 4 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Updated 2 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 8 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆165Updated 4 years ago
- ☆55Updated 9 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago
- ☆34Updated 7 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆77Updated 2 years ago
- Text Simplification System and Dataset☆122Updated 2 years ago
- ☆23Updated 8 years ago
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆249Updated 9 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- Neural network sequence labeling model☆250Updated 6 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 4 years ago