daormar / thot
Thot toolkit for statistical machine translation
☆53Updated 2 years ago
Alternatives and similar repositories for thot:
Users that are interested in thot are comparing it to the libraries listed below
- Code for morphological transformations☆29Updated 7 years ago
- ☆21Updated 10 years ago
- ☆12Updated 9 years ago
- Fast Word Clustering Software☆78Updated 3 months ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 3 years ago
- Appraise evaluation system for manual evaluation of machine translation output☆74Updated 4 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 5 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Updated 8 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- Offline extractor of synchronous context-free grammars for machine translation.☆31Updated 9 years ago
- Corpus preprocessing☆96Updated last year
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆77Updated 2 years ago
- Machine translation for the real world☆23Updated 5 years ago
- Helsinki Neural Machine Translation system☆28Updated 4 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Translation Error Rate (TER)☆43Updated 6 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- ☆27Updated 8 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 2 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- ☆23Updated 7 years ago