MAFAND-MT
☆61Jul 9, 2024Updated last year
Alternatives and similar repositories for lafand-mt
Users that are interested in lafand-mt are comparing it to the libraries listed below
Sorting:
- Crosslingual Question Answering for African Languages☆30Sep 27, 2024Updated last year
- ☆17Jan 12, 2023Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆25May 12, 2024Updated last year
- This repository contains multi-modal speech data for African languages that can be used to train ASR and NLP models☆17Aug 31, 2022Updated 3 years ago
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆18Apr 29, 2024Updated last year
- POS for African languages☆19Jun 25, 2025Updated 8 months ago
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆114Apr 26, 2024Updated last year
- ☆118Oct 15, 2025Updated 4 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- ☆23May 12, 2024Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆82May 31, 2022Updated 3 years ago
- ☆12Nov 9, 2025Updated 4 months ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆36Oct 14, 2025Updated 4 months ago
- wolof-subtiles-generator permet de générer des sous-titres en wolof pour des fichiers audio et de créer des vidéos avec les sous-titres i…☆29Aug 27, 2023Updated 2 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆16Oct 20, 2020Updated 5 years ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆38Jul 31, 2025Updated 7 months ago
- Machine Translation for Africa☆312Jun 14, 2022Updated 3 years ago
- Bringing ChatGPT Plugins to All (LLMs and Humans)☆18May 20, 2023Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- build gpt-index using chatgpt and sentence-transformers☆14Apr 8, 2023Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆42Oct 13, 2022Updated 3 years ago
- Instruct-tuning LLaMA on consumer hardware with machine-translated data☆19Apr 17, 2023Updated 2 years ago
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 5 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- Source code for GlorIA models pre-training.☆21Apr 3, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- ☆267Aug 1, 2025Updated 7 months ago
- ☆26Nov 23, 2023Updated 2 years ago
- ☆35Feb 10, 2025Updated last year
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Jun 24, 2023Updated 2 years ago