bonaventuredossou / MLM_ALLinks
☆23Updated last year
Alternatives and similar repositories for MLM_AL
Users that are interested in MLM_AL are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆12Updated 3 years ago
- MAFAND-MT☆60Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Crosslingual Question Answering for African Languages☆30Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- POS for African languages☆19Updated 7 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- ☆117Updated 3 months ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆65Updated last year
- COMET for African languages☆10Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated last year
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Updated 11 months ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 3 years ago
- ☆127Updated last year
- Building an effective preprocessing tool for African languages☆13Updated 2 years ago
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆44Updated last year
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93Updated last year
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanaga…☆37Updated 2 years ago
- build gpt-index using chatgpt and sentence-transformers☆14Updated 2 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆134Updated 2 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 6 months ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆105Updated 9 months ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆174Updated last month
- Arabic nested named entity recognition☆43Updated 10 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- ☆12Updated last year