bonaventuredossou / MLM_ALLinks
☆22Updated last year
Alternatives and similar repositories for MLM_AL
Users that are interested in MLM_AL are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- MAFAND-MT☆57Updated last year
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆75Updated 3 years ago
- POS for African languages☆17Updated 2 months ago
- Crosslingual Question Answering for African Languages☆31Updated 11 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- ☆110Updated last year
- COMET for African languages☆10Updated 7 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆109Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated last month
- ☆124Updated last year
- Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.☆42Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 3 years ago
- Machine Translation for Africa☆294Updated 3 years ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 8 months ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆80Updated 2 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 8 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆99Updated 4 months ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆130Updated last year
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated 10 months ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆17Updated 11 months ago
- A New Tamil Large Language Model (LLM) Based on Llama 2☆310Updated last year
- Place where folks can contribute to 🤗 community events☆425Updated last year