marian-nmt / marian-dev
Fast Neural Machine Translation in C++ - development repository
☆257Updated last month
Related projects ⓘ
Alternatives and complementary repositories for marian-dev
- Fast and customizable text tokenization library with BPE and SentencePiece support☆284Updated 2 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 5 months ago
- Fast Neural Machine Translation in C++☆1,255Updated last year
- Bitextor generates translation memories from multilingual websites☆291Updated last week
- Corpus preprocessing☆95Updated 8 months ago
- Fast BPE☆656Updated 5 months ago
- int8_t and int16_t matrix multiply based on https://arxiv.org/abs/1705.01991☆63Updated 10 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- Simple, fast unsupervised word aligner☆738Updated 2 years ago
- eXtensible Neural Machine Translation☆185Updated 4 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆488Updated 5 months ago
- Efficient Low-Memory Aligner☆139Updated 2 months ago
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆79Updated 4 years ago
- Efficient teacher-student models and scripts to make them☆48Updated 11 months ago
- ☆42Updated 6 years ago
- scripts and configuration files for Edinburgh neural MT submission to WMT 16 shared translation task☆138Updated 4 years ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆78Updated last year
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆161Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆220Updated last year
- OpusFilter - Parallel corpus processing toolkit☆102Updated 3 months ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆45Updated 6 months ago
- Sentence aligner☆108Updated 3 years ago
- Automatic extraction of edited sentences from text edition histories.☆81Updated 2 years ago
- C++/CUDA toolkit for training sequence and sequence-to-sequence models across multiple GPUs☆186Updated 7 years ago
- Collection of Evaluation Metrics and Algorithms for Machine Translation☆76Updated 6 years ago
- Tool for comparison and evaluation of machine translation.☆56Updated 2 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆431Updated 2 years ago