masakhane-io / afriqaView external linksLinks
Crosslingual Question Answering for African Languages
☆30Sep 27, 2024Updated last year
Alternatives and similar repositories for afriqa
Users that are interested in afriqa are comparing it to the libraries listed below
Sorting:
- COMET for African languages☆10Jan 24, 2025Updated last year
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- ☆17Jan 12, 2023Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24May 12, 2024Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Aug 15, 2022Updated 3 years ago
- POS for African languages☆19Jun 25, 2025Updated 7 months ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Apr 26, 2024Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80May 31, 2022Updated 3 years ago
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- ☆117Oct 15, 2025Updated 3 months ago
- ☆13Feb 7, 2023Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- A list of scripts/notebooks I'd like to keep handy☆18Aug 15, 2024Updated last year
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Dec 16, 2025Updated last month
- Code for "MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER"☆48Jun 18, 2022Updated 3 years ago
- Multilingual Open Text☆25May 8, 2025Updated 9 months ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆29Jul 14, 2023Updated 2 years ago
- ☆23May 12, 2024Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆41Jan 27, 2026Updated 2 weeks ago
- AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…☆11Aug 22, 2024Updated last year
- A tiny BERT for low-resource monolingual models☆31Dec 24, 2025Updated last month
- wolof-subtiles-generator permet de générer des sous-titres en wolof pour des fichiers audio et de créer des vidéos avec les sous-titres i…☆29Aug 27, 2023Updated 2 years ago
- ☆12Nov 3, 2024Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- 中古漢語(切韻音系)全拼及三拼☆32Mar 26, 2021Updated 4 years ago
- ☆11Feb 4, 2018Updated 8 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆41Oct 13, 2022Updated 3 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- ☆263Aug 1, 2025Updated 6 months ago
- Masakhane Web is a translation web application for solely African Languages.☆37Aug 11, 2023Updated 2 years ago
- Do Multilingual Language Models Think Better in English?☆42Aug 3, 2023Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- ☆14Feb 13, 2021Updated 5 years ago
- Products Information Portal and Microservices☆13Sep 17, 2025Updated 4 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- Python wrapper for the energy system optimization framework IESopt.☆18Jan 26, 2026Updated 2 weeks ago
- ☆12Sep 27, 2024Updated last year