moses-smt / mgizaView external linksLinks
A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.
☆166May 12, 2021Updated 4 years ago
Alternatives and similar repositories for mgiza
Users that are interested in mgiza are comparing it to the libraries listed below
Sorting:
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆273Nov 18, 2025Updated 2 months ago
- Simple, fast unsupervised word aligner☆766Jul 19, 2022Updated 3 years ago
- Moses, the machine translation system☆1,623Mar 28, 2025Updated 10 months ago
- Scripts to preprocess training and test data and to run fast_align and giza☆107Nov 2, 2021Updated 4 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago
- Fork of http://nlg.isi.edu/software/nplm/ with some efficiency tweaks and adaptation for use in mosesdecoder.☆13Sep 3, 2015Updated 10 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- Pipelined quality estimation.☆51Aug 13, 2019Updated 6 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Aug 19, 2021Updated 4 years ago
- Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) contex…☆185May 26, 2020Updated 5 years ago
- HPYLMのC++実装☆11May 2, 2017Updated 8 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- lamtram: A toolkit for neural language and translation modeling☆142Apr 16, 2018Updated 7 years ago
- Latent-variable Synchronous Context-Free Grammar Toolkit☆10Sep 30, 2014Updated 11 years ago
- Machine Translation Evaluation Metric☆39Dec 6, 2017Updated 8 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆42Sep 6, 2025Updated 5 months ago
- eXtensible Neural Machine Translation☆186Sep 22, 2025Updated 4 months ago
- Collection of Evaluation Metrics and Algorithms for Machine Translation☆76Mar 5, 2018Updated 7 years ago
- CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++☆21Oct 28, 2018Updated 7 years ago
- pialign - A Phrasal ITG Aligner☆24Apr 29, 2019Updated 6 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆495Feb 6, 2026Updated last week
- Transition-based dependency parser based on stack LSTMs☆206Nov 17, 2019Updated 6 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆81Apr 7, 2020Updated 5 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,263Aug 7, 2024Updated last year
- A neural word aligner based on multilingual BERT☆370Mar 10, 2022Updated 3 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- Machine translation for the real world☆23Jan 22, 2020Updated 6 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- The Berkeley Word Aligner☆23Mar 24, 2016Updated 9 years ago
- Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better …☆204Feb 25, 2023Updated 2 years ago
- Fast Neural Machine Translation in C++☆1,420Aug 25, 2023Updated 2 years ago
- Learn Classical Statistical Machine Translation Systems.☆18May 27, 2020Updated 5 years ago
- A repository for experiments in quality-aware decoding☆18Jun 7, 2022Updated 3 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- A C++ toolkit for neural machine translation for CPU☆88Jun 11, 2019Updated 6 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆387Nov 7, 2023Updated 2 years ago
- Open-Source Neural Machine Translation in Tensorflow☆802Dec 9, 2022Updated 3 years ago
- A tool for holistic analysis of language generations systems☆471Sep 22, 2025Updated 4 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year