geezorg / data
Lexical Data of Ge'ez Languages
☆52Updated 2 years ago
Alternatives and similar repositories for data:
Users that are interested in data are comparing it to the libraries listed below
- Different semantic models for Amharic☆17Updated last year
- Morphological processing for languages of the Horn of Africa☆43Updated this week
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆33Updated last year
- Amharic/Tigrinya/Oromo Dictionaries☆37Updated last year
- An Amharic News Text classification Dataset☆37Updated 8 months ago
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆11Updated last year
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆40Updated 6 years ago
- eBooks in Development and Completed☆25Updated last month
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- A library for generating Ethiopic fake data such as names, addresses, and phone numbers☆16Updated 6 years ago
- ☆15Updated 5 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- ☆11Updated 3 years ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 7 years ago
- The set of files used for the development of the Amharic Corpus.☆11Updated 7 years ago
- OpenITI releases☆29Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆32Updated 3 weeks ago
- Arabic News☆12Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- List of arabic names (in both english and arabic letters) with their gender☆39Updated 10 years ago
- About 6,500 Irish lemmas ordered by corpus frequency, with noise removed.☆33Updated 6 years ago
- jQuery plugin for Amharic keyboard support online☆12Updated 9 years ago
- A comprehensive list of Arabic NLP resources.☆19Updated last month
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆104Updated 7 years ago
- List of Malay words in both Rumi and Jawi scripts☆17Updated 3 years ago
- ☆12Updated 2 years ago
- 📝A text file containing 150,000 Urdu words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion.☆44Updated 4 years ago
- Aksharamukha Python Library☆44Updated 3 months ago