hltdi / HornMorpho
Morphological processing for languages of the Horn of Africa
☆45Updated 2 months ago
Alternatives and similar repositories for HornMorpho:
Users that are interested in HornMorpho are comparing it to the libraries listed below
- Different semantic models for Amharic☆17Updated last year
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆42Updated 6 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 3 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆12Updated last year
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 8 years ago
- An Amharic News Text classification Dataset☆37Updated 10 months ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆34Updated last year
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- A lexicon compiler for non-suffixational morphologies☆12Updated 2 months ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- Amharic speech recognition using Deep Learning☆19Updated 5 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- ☆19Updated 3 years ago
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated 2 years ago
- A library for generating Ethiopic fake data such as names, addresses, and phone numbers☆16Updated 6 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- ThamizhiMorph: A Tamil Morphological Analyser and Generator☆17Updated last year
- ☆42Updated 7 years ago
- Python Finite-State Toolkit☆53Updated last month
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- The set of files used for the development of the Amharic Corpus.☆11Updated 7 years ago
- The curation repository for the data behind Concepticon.☆38Updated last month
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆103Updated 11 months ago
- PHOIBLE data and development.☆122Updated 8 months ago