techiaith / docker-moses-smtLinks
Hwyluso cyfieithu peirianyddol MosesSMT i'r Gymraeg // Making MosesSMT machine translation easier for Welsh (and other languages)
☆16Updated 4 years ago
Alternatives and similar repositories for docker-moses-smt
Users that are interested in docker-moses-smt are comparing it to the libraries listed below
Sorting:
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Efficient Markov Chain word alignment☆53Updated 4 years ago
- ☆12Updated 9 years ago
- ☆42Updated 7 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 4 years ago
- Corpus preprocessing☆98Updated last year
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 3 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- Efficient Low-Memory Aligner☆146Updated 8 months ago
- Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) contex…☆185Updated 5 years ago
- ☆23Updated 8 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- ☆27Updated 8 years ago
- ☆21Updated 10 years ago
- Decoding platform for machine translation research☆55Updated 6 years ago
- A dataset of sentences with ordinal labels for grammaticality☆29Updated 11 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated last year
- Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018☆124Updated 5 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- Fast Word Clustering Software☆78Updated 7 months ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Simple Wikipedia plain text extractor with article link annotations and Hadoop support.☆103Updated 14 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆165Updated 4 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆63Updated 6 years ago