uhh-lt / ethiopicmodelsLinks
Different semantic models for Amharic
☆21Updated last year
Alternatives and similar repositories for ethiopicmodels
Users that are interested in ethiopicmodels are comparing it to the libraries listed below
Sorting:
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆16Updated 6 months ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆11Updated last year
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆37Updated 2 years ago
- An Amharic News Text classification Dataset☆38Updated last year
- Morphological processing for languages of the Horn of Africa☆51Updated 2 months ago
- Lexical Data of Ge'ez Languages☆54Updated 3 years ago
- AmQA - The first Amharic Open Domain Question Answering Dataset☆13Updated last year
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Updated 3 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated 2 years ago
- Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)☆10Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆43Updated 7 years ago
- ☆15Updated last month
- ☆115Updated 2 months ago
- ☆17Updated 2 years ago
- ☆12Updated 3 years ago
- A comprehensive list of Arabic NLP resources.☆43Updated 3 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- Datasets for Hate Speech Detection☆134Updated 2 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Camel Morph’s goal is to build large open-source morphological models for Arabic and its dialects across many genres and domains.☆14Updated last year
- Benchmark Arabic text diacritization dataset☆76Updated 6 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 3 years ago
- Natural Language Processing Research in North American Linguistics Departments☆20Updated last month
- The Kurdish Language Processing Toolkit☆107Updated 4 months ago
- Amharic speech recognition using Deep Learning☆23Updated 6 years ago
- Multilingual sentence alignment using sentence embeddings☆131Updated last year
- An educational tool to train, inspect, evaluate and translate using neural engines☆19Updated 9 months ago
- This is a Pytorch (+ Huggingface transformers) implementation of a "simple" text classifier defined using BERT-based models. In this lab …☆19Updated 4 years ago