FrancisGregoire / parSentExtractLinks
A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.
☆33Updated 7 years ago
Alternatives and similar repositories for parSentExtract
Users that are interested in parSentExtract are comparing it to the libraries listed below
Sorting:
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆123Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- Resources for the OpenNMT hackathon☆51Updated 6 years ago
- Neural models and instructions on how to reproduce our results for our neural grammatical error correction systems from M. Junczys-Dowmun…☆88Updated 6 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Updated 10 years ago
- eXtensible Neural Machine Translation☆185Updated 2 months ago
- TER-plus Machine Translation metric.☆31Updated 3 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Updated 4 years ago
- Efficient Markov Chain word alignment☆52Updated 4 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Updated 2 years ago
- ☆34Updated 7 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- Decoding platform for machine translation research☆54Updated 6 years ago
- DRESS simplification model (EMNLP 2017) described in http://aclweb.org/anthology/D/D17/D17-1062.pdf☆154Updated 4 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆125Updated 8 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆165Updated 4 years ago
- Source code for the paper "Morphological Inflection Generation with Hard Monotonic Attention"☆38Updated 7 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 4 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 7 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 11 years ago
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.☆49Updated 6 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- ☆23Updated 8 years ago
- A repository linking to publicly available dialog datasets. Feel free to send pull requests.☆69Updated 3 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆79Updated 2 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 5 years ago