uhh-lt / ethiopicmodelsLinks
Different semantic models for Amharic
☆21Updated last year
Alternatives and similar repositories for ethiopicmodels
Users that are interested in ethiopicmodels are comparing it to the libraries listed below
Sorting:
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆13Updated 2 months ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆10Updated last year
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- Morphological processing for languages of the Horn of Africa☆46Updated 2 weeks ago
- AmQA - The first Amharic Open Domain Question Answering Dataset☆12Updated last year
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆36Updated 2 years ago
- An Amharic News Text classification Dataset☆38Updated last year
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆10Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 4 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆84Updated 6 months ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated 2 years ago
- A comprehensive list of Arabic NLP resources.☆34Updated 2 months ago
- Improved Sentence Alignment in Linear Time and Space☆180Updated 2 years ago
- ☆14Updated last week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated last month
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- ☆17Updated 2 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- A multilingual parallel corpus created from translations of the Bible.☆183Updated 2 months ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆16Updated 10 months ago
- ☆110Updated last year
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆19Updated last year
- MAFAND-MT☆57Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- BRAD: Books Reviews in Arabic Dataset☆15Updated 7 years ago
- ☆14Updated 4 years ago
- ☆49Updated last year
- Arabic edition of BERT pretrained language models☆130Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆373Updated last year