hltdi / HornMorpho
Morphological processing for languages of the Horn of Africa
☆45Updated 3 months ago
Alternatives and similar repositories for HornMorpho
Users that are interested in HornMorpho are comparing it to the libraries listed below
Sorting:
- Different semantic models for Amharic☆19Updated last year
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆43Updated 6 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 3 years ago
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- A lexicon compiler for non-suffixational morphologies☆12Updated last month
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆13Updated 2 years ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated last year
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 8 years ago
- Helsinki Finite-State Technology (library and application suite)☆129Updated 3 weeks ago
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆20Updated 7 years ago
- An Amharic News Text classification Dataset☆37Updated 11 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated this week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆44Updated 2 years ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- ☆15Updated 5 years ago
- Tools and scripts for working with ELAN☆10Updated 2 years ago
- Python Finite-State Toolkit☆54Updated 2 months ago
- PHOIBLE data and development.☆125Updated 10 months ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆36Updated last year
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- ☆19Updated 3 years ago
- A character-wise tokenizer for morphologically rich languages☆27Updated 2 months ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆10Updated last year
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 10 months ago