MarsPanther / Amharic-English-Machine-Translation-Corpus
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 6 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus:
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆13Updated 2 years ago
- Different semantic models for Amharic☆19Updated last year
- An Amharic News Text classification Dataset☆37Updated 11 months ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- Morphological processing for languages of the Horn of Africa☆45Updated 3 months ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆35Updated last year
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Amharic speech recognition using Deep Learning☆20Updated 5 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 3 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- ☆15Updated 5 years ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆10Updated 11 months ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- The set of files used for the development of the Amharic Corpus.☆11Updated 7 years ago
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 10 months ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 10 months ago
- AmQA - The first Amharic Open Domain Question Answering Dataset☆12Updated 10 months ago
- ☆42Updated 7 years ago
- ☆14Updated 4 years ago
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆154Updated 3 weeks ago
- Python Finite-State Toolkit☆54Updated last month
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated 2 years ago
- Transformer based translation quality estimation☆110Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated 3 weeks ago
- Improved Sentence Alignment in Linear Time and Space☆169Updated 2 years ago
- ☆72Updated 3 weeks ago