bonaventuredossou / MLM_AL
☆19Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for MLM_AL
- MAFAND-MT☆54Updated 4 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆18Updated 6 months ago
- POS for African languages☆17Updated 9 months ago
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆11Updated 2 years ago
- Crosslingual Question Answering for African Languages☆29Updated last month
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- ☆16Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆15Updated 3 months ago
- ☆105Updated 11 months ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆54Updated 7 months ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆41Updated 3 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆14Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆92Updated 6 months ago
- Curate online wolof text resources that can be used to build models☆21Updated 4 months ago
- build gpt-index using chatgpt and sentence-transformers☆13Updated last year
- Python intefrace for evaluation on chatgpt models☆19Updated 9 months ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆31Updated 10 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆46Updated 10 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆52Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆64Updated this week
- The official implementation of CATT Arabic diacritization models.☆35Updated 3 months ago
- ☆40Updated last year
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated 4 months ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆23Updated 3 years ago
- Finetuning Whisper ASR model for Belarusian language☆14Updated last year