bonaventuredossou / MLM_ALLinks
☆23Updated last year
Alternatives and similar repositories for MLM_AL
Users that are interested in MLM_AL are comparing it to the libraries listed below
Sorting:
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆12Updated 3 years ago
- ☆17Updated 2 years ago
- MAFAND-MT☆60Updated last year
- POS for African languages☆19Updated 6 months ago
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆44Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated last year
- All our community docs! Start here! Lets put Africa on the NLP Map☆64Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Crosslingual Question Answering for African Languages☆30Updated last year
- ☆127Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆281Updated last year
- ☆12Updated 10 months ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Updated 10 months ago
- Tunisian Arabish Corpus☆11Updated last year
- ☆116Updated 2 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 5 months ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆81Updated 2 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 6 months ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆105Updated 9 months ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 3 years ago
- COMET for African languages☆10Updated 11 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- This repo is for semantic search app to search over Quran tafsir books☆24Updated last year
- build gpt-index using chatgpt and sentence-transformers☆14Updated 2 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93Updated last year
- customer care chatbot made with Rasa Open Source.☆43Updated 3 years ago