masakhane-io / lafand-mtView external linksLinks
MAFAND-MT
☆60Jul 9, 2024Updated last year
Alternatives and similar repositories for lafand-mt
Users that are interested in lafand-mt are comparing it to the libraries listed below
Sorting:
- Crosslingual Question Answering for African Languages☆30Sep 27, 2024Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆24May 12, 2024Updated last year
- A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.☆18Apr 29, 2024Updated last year
- POS for African languages☆19Jun 25, 2025Updated 7 months ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Apr 26, 2024Updated last year
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- ☆117Oct 15, 2025Updated 4 months ago
- Masakhane Web is a translation web application for solely African Languages.☆37Aug 11, 2023Updated 2 years ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- ☆23May 12, 2024Updated last year
- ☆12Nov 9, 2025Updated 3 months ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021☆13Jan 16, 2023Updated 3 years ago
- ☆53Dec 3, 2021Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆37Jul 31, 2025Updated 6 months ago
- build gpt-index using chatgpt and sentence-transformers☆14Apr 8, 2023Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆41Oct 13, 2022Updated 3 years ago
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆81May 23, 2025Updated 8 months ago
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 4 months ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- ☆52Jun 6, 2023Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Jan 10, 2024Updated 2 years ago
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- ☆26May 30, 2023Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- ☆263Aug 1, 2025Updated 6 months ago
- ☆26Nov 23, 2023Updated 2 years ago
- AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…☆11Aug 22, 2024Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆33Jun 24, 2023Updated 2 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- Wolof is a library that you can use to do specific tasks in NLP with the Wolof language e.g. text classification in Wolof , NMT , ASR☆31Nov 28, 2023Updated 2 years ago
- ☆32Feb 8, 2025Updated last year
- ☆12Nov 3, 2024Updated last year
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year