bonaventuredossou / MLM_AL
☆22Updated 11 months ago
Alternatives and similar repositories for MLM_AL:
Users that are interested in MLM_AL are comparing it to the libraries listed below
- ☆17Updated 2 years ago
- MAFAND-MT☆55Updated 9 months ago
- Crosslingual Question Answering for African Languages☆29Updated 7 months ago
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆11Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated 11 months ago
- COMET for African languages☆10Updated 3 months ago
- POS for African languages☆17Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆73Updated 2 years ago
- ☆12Updated last month
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆104Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 2 years ago
- ☆110Updated last year
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆16Updated last year
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆20Updated 3 months ago
- ☆10Updated last year
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆41Updated 11 months ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 3 years ago
- Building an effective preprocessing tool for African languages☆12Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆14Updated 11 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆19Updated 8 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated 10 months ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆15Updated 4 years ago
- ☆10Updated last year
- This repository houses materials consulted by the instructors☆11Updated 3 years ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆25Updated 4 months ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆12Updated last year
- A list of scripts/notebooks I'd like to keep handy☆16Updated 8 months ago