AAUThematic4LT / Parallel-Corpora-for-Ethiopian-Languages
☆15Updated 5 years ago
Alternatives and similar repositories for Parallel-Corpora-for-Ethiopian-Languages:
Users that are interested in Parallel-Corpora-for-Ethiopian-Languages are comparing it to the libraries listed below
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆12Updated last year
- Different semantic models for Amharic☆17Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆42Updated 6 years ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Morphological processing for languages of the Horn of Africa☆45Updated 2 months ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- ☆42Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 2 years ago
- An Amharic News Text classification Dataset☆37Updated 10 months ago
- Bilingual term extractor☆53Updated last year
- ☆49Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆74Updated last year
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆103Updated 11 months ago
- Machine Translation (MT) Preparation Scripts☆31Updated last month
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆106Updated 7 years ago
- ☆23Updated 5 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique☆27Updated 3 years ago
- ☆17Updated 2 years ago
- This is a monolingual English corpus of native, non-native and (human) translated texts extracted from the European Parliament.☆9Updated 3 years ago
- Open information and community for machine translation☆74Updated last week
- ☆14Updated 4 years ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆34Updated last year
- ☆25Updated last year
- Use Python and NLTK to build out your own text classifiers and solve common NLP problems☆47Updated 5 years ago