farshadjafari / parallel_corpus_generatorLinks
Python application, generating parallel corpus for any language pairs, can be used for training nmt (Neural Machine Translation) systems
☆12Updated 3 years ago
Alternatives and similar repositories for parallel_corpus_generator
Users that are interested in parallel_corpus_generator are comparing it to the libraries listed below
Sorting:
- Sentence aligner☆124Updated 4 years ago
- ☆16Updated 2 years ago
- The Open Parallel Corpus☆82Updated 3 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- Efficient teacher-student models and scripts to make them☆54Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Updated 7 months ago
- Multilingual sentence alignment using sentence embeddings☆139Updated last year
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆80Updated 8 months ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆42Updated 5 months ago
- A list of awesome Machine Translation frameworks, libraries, software and papers☆197Updated last year
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆18Updated last month
- ☆81Updated last week
- Open information and community for machine translation☆80Updated last week
- Finite-state script normalization and processing utilities☆46Updated 3 weeks ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆37Updated this week
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆24Updated 2 years ago
- Fast Neural Machine Translation in C++ - development repository☆284Updated 7 months ago
- Translation demonstrator☆37Updated 5 years ago
- UniParse: A universal graph-based parsing toolkit☆10Updated 6 years ago
- ☆42Updated 7 years ago
- ☆19Updated 4 years ago
- Bilingual sengence aligner☆28Updated 2 months ago
- Efficient Low-Memory Aligner☆146Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- This packages up data for the Open Multilingual Wordnet☆60Updated last week
- Transform TMX to text☆28Updated 3 years ago
- Bilingual sentence similarity classifier using Tensorflow☆24Updated 6 years ago
- Efficient Markov Chain word alignment☆52Updated 4 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆99Updated 2 years ago