rasyosef / amharic-news-category-classificationLinks
notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library
☆10Updated last year
Alternatives and similar repositories for amharic-news-category-classification
Users that are interested in amharic-news-category-classification are comparing it to the libraries listed below
Sorting:
- Different semantic models for Amharic☆21Updated last year
- An Amharic News Text classification Dataset☆38Updated last year
- AmQA - The first Amharic Open Domain Question Answering Dataset☆12Updated last year
- Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities☆13Updated last month
- ☆17Updated 2 years ago
- Morphological processing for languages of the Horn of Africa☆46Updated this week
- ☆12Updated 3 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆38Updated last year
- A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus i…☆36Updated 2 years ago
- Lexical Data of Ge'ez Languages☆54Updated 2 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆119Updated 11 months ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆84Updated 5 months ago
- Multicultural Proverbs and Sayings☆11Updated 6 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- MAFAND-MT☆57Updated last year
- TODa: Tamazight Open Dataset☆16Updated 6 months ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆13Updated 6 months ago
- Research code for pixel-based encoders of language (PIXEL)☆337Updated this week
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆85Updated last year
- This repository contains an extension of fairseq for pixel / visual representations for machine translation.☆36Updated last year
- Code for Arabic Nougat☆42Updated 7 months ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 2 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆10Updated 2 years ago
- Crosslingual Reasoning through Test-Time Scaling☆18Updated 2 months ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆22Updated 5 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- Arabic Wikipedia Extracts☆13Updated 3 years ago
- ☆13Updated last week
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year