krzwolk / Glosbe-Translation-Memory-CrawlerLinks
parallel corpora for any languages supported by glosbe.com
☆10Updated 9 years ago
Alternatives and similar repositories for Glosbe-Translation-Memory-Crawler
Users that are interested in Glosbe-Translation-Memory-Crawler are comparing it to the libraries listed below
Sorting:
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- ☆12Updated 9 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆77Updated 2 years ago
- Fork of http://nlg.isi.edu/software/nplm/ for threadsafety and efficiency.☆18Updated 11 years ago
- Pipelined quality estimation.☆51Updated 5 years ago
- ☆42Updated 6 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Translation Error Rate (TER)☆44Updated 7 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated last week
- Translation Memory Open-source Purifier☆34Updated 2 years ago
- Collaborative on-line editor for aligned parallel texts.☆13Updated 3 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- ☆34Updated 8 years ago
- Read-only unofficial mirror of Pynini☆17Updated 6 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Updated 11 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 11 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- Efficient Low-Memory Aligner☆145Updated 5 months ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- Bitextor generates translation memories from multilingual websites☆293Updated 7 months ago
- Translation demonstrator☆34Updated 5 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆164Updated 4 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 10 months ago
- Corpus preprocessing☆97Updated last year
- ☆22Updated 5 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 5 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- repo for Tibetan corpora☆21Updated 2 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- Transform TMX to text☆27Updated 2 years ago