bonaventuredossou / MLM_ALLinks
☆23Updated last year
Alternatives and similar repositories for MLM_AL
Users that are interested in MLM_AL are comparing it to the libraries listed below
Sorting:
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆12Updated 3 years ago
- ☆17Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- wolof-subtiles-generator permet de générer des sous-titres en wolof pour des fichiers audio et de créer des vidéos avec les sous-titres i…☆29Updated 2 years ago
- MAFAND-MT☆59Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆77Updated 3 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- Crosslingual Question Answering for African Languages☆31Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- ☆12Updated 7 months ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- POS for African languages☆19Updated 3 months ago
- build gpt-index using chatgpt and sentence-transformers☆14Updated 2 years ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Updated 7 months ago
- ☆112Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆175Updated last year
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆103Updated 6 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 2 months ago
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆43Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 10 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- ☆12Updated last year
- Performing a RAG (Retrieval Augmented Generation) assessment using voice-to-voice query resolution. Provide the file containing the queri…☆45Updated last year
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆82Updated 2 weeks ago
- ☆125Updated last year
- COMET for African languages☆10Updated 8 months ago
- The official implementation of CATT Arabic diacritization models.☆54Updated 3 months ago
- Machine Translation for Africa☆296Updated 3 years ago