MarsPanther / Amharic-English-Machine-Translation-CorpusLinks
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 7 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
Sorting:
- Morphological processing for languages of the Horn of Africa☆52Updated last month
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆790Updated last month
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- Different semantic models for Amharic☆21Updated 2 years ago
- ☆48Updated 8 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆291Updated 3 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆386Updated 2 years ago
- Machine Translation for Africa☆304Updated 3 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Updated 2 years ago
- A comprehensive list of Hebrew NLP resources.☆282Updated 8 months ago
- Massively multilingual pronunciation mining☆360Updated 2 weeks ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆53Updated 2 years ago
- A curated list of research papers and resources on code-switching☆329Updated last week
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆31Updated 2 years ago
- Datasets and tools for basic natural language processing.☆388Updated 4 years ago
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆80Updated 8 months ago
- Improved Sentence Alignment in Linear Time and Space☆186Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆237Updated last year
- A neural word aligner based on multilingual BERT☆368Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆21Updated 6 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- ☆23Updated 4 years ago
- This is a Pytorch (+ Huggingface transformers) implementation of a "simple" text classifier defined using BERT-based models. In this lab …☆19Updated 4 years ago
- Useful resources for Mongolian NLP☆197Updated last year
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆85Updated 6 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Updated 5 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- ☆45Updated 3 years ago