MarsPanther / Amharic-English-Machine-Translation-CorpusLinks
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 7 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
Sorting:
- Morphological processing for languages of the Horn of Africa☆46Updated last week
- Punctuation restoration and spell correction experiments.☆250Updated 4 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Updated last year
- A curated list of research papers and resources on code-switching☆326Updated 9 months ago
- 📝An easy-to-use package to restore punctuation of the text.☆118Updated 2 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- Bitextor generates translation memories from multilingual websites☆295Updated 10 months ago
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- Python library for converting numbers to words for all Indian Languages.☆36Updated 4 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆376Updated last year
- A tool for converting TMX files into bilingual corpora☆18Updated 5 years ago
- A neural word aligner based on multilingual BERT☆357Updated 3 years ago
- Massively multilingual pronunciation mining☆349Updated last month
- A sentence segmenter that actually works!☆305Updated 5 years ago
- ☆45Updated 3 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆220Updated last year
- Machine-Translation-based sentence alignment tool for parallel text☆312Updated 4 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆225Updated last year
- Automatically exported from code.google.com/p/m2m-aligner☆42Updated 9 years ago
- ☆45Updated 7 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆276Updated 3 months ago
- ☆24Updated 4 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆749Updated 2 months ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆181Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆28Updated 2 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆450Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Transformer based translation quality estimation☆113Updated 2 years ago