hltdi / HornMorpho
Morphological processing for languages of the Horn of Africa
☆43Updated this week
Alternatives and similar repositories for HornMorpho:
Users that are interested in HornMorpho are comparing it to the libraries listed below
- Lexical Data of Ge'ez Languages☆52Updated 2 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆37Updated last year
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆40Updated 6 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 2 years ago
- Different semantic models for Amharic☆17Updated last year
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆33Updated last year
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 10 months ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- SIGTYP 2022 Shared Task☆9Updated 2 years ago
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 7 years ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆11Updated last year
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- An Amharic News Text classification Dataset☆37Updated 8 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A library for generating Ethiopic fake data such as names, addresses, and phone numbers☆16Updated 6 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆45Updated last year
- A lexicon compiler for non-suffixational morphologies☆11Updated 3 weeks ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- CLDF: Cross-Linguistic Data Formats - the specification☆56Updated 9 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- PHOIBLE data and development.☆122Updated 6 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Arabic Transliteration in Python☆34Updated 11 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 7 months ago
- The Metadata Editor for Transparent Archiving of language document materials☆20Updated last week
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆28Updated last month
- The curation repository for the data behind Concepticon.☆37Updated this week
- Cross-Linguistic Transcription Systems☆14Updated last month
- Automatically exported from code.google.com/p/foma☆119Updated 6 months ago