masakhane-io/afriqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/masakhane-io/afriqa)

masakhane-io / afriqa

Crosslingual Question Answering for African Languages

☆31

Alternatives and similar repositories for afriqa

Users that are interested in afriqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated 2 months ago
masakhane-io / lafand-mt
View on GitHub
MAFAND-MT
☆63Jul 9, 2024Updated 2 years ago
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
gauthelo / kallaama-speech-dataset
View on GitHub
A transcribed speech dataset in Wolof, Pulaar and Sereer, to support agriculture. Funded by Lacuna Fund.
☆20Mar 26, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dadelani / africanlp-resources
View on GitHub
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
☆13Aug 15, 2022Updated 3 years ago
masakhane-io / masakhane-news
View on GitHub
MasakhaNEWS: News Topic Classification for African Languages
☆26May 12, 2024Updated 2 years ago
castorini / afriberta
View on GitHub
AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages
☆83May 31, 2022Updated 4 years ago
masakhane-io / masakhane-mt
View on GitHub
Machine Translation for Africa
☆322Jun 14, 2022Updated 4 years ago
alpoktem / bible2speechDB
View on GitHub
Scripts to create speech corpora from open.bible
☆13Jan 3, 2022Updated 4 years ago
masakhane-io / masakhane-ner
View on GitHub
☆122Oct 15, 2025Updated 9 months ago
hgilles06 / infashai
View on GitHub
☆16Feb 22, 2022Updated 4 years ago
WolofProcessing / online_wolof_data
View on GitHub
Curate online wolof text resources that can be used to build models
☆28Jun 25, 2026Updated last month
ylacombe / scripts_and_notebooks
View on GitHub
A list of scripts/notebooks I'd like to keep handy
☆18Aug 15, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
castorini / NanoKnow
View on GitHub
☆16Jun 26, 2026Updated 3 weeks ago
masakhane-io / masakhane-pos
View on GitHub
POS for African languages
☆21Jun 25, 2025Updated last year
cisnlp / Glot500
View on GitHub
[ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
☆107Apr 14, 2026Updated 3 months ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
Mister-iks / ai_suggest_deployment
View on GitHub
AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…
☆11Aug 22, 2024Updated last year
maria-antoniak / fight-harassment-in-research
View on GitHub
☆17Aug 19, 2024Updated last year
nk2028 / commonly-used-chinese-characters-and-words
View on GitHub
漢語常用字詞表
☆16Jun 3, 2023Updated 3 years ago
asahi417 / lm-vocab-trimmer
View on GitHub
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…
☆67Oct 25, 2024Updated last year
GeneZC / MiniMoE
View on GitHub
Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"
☆29Jul 14, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bltlab / seqscore
View on GitHub
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
☆23Jul 16, 2026Updated last week
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
WhiredPlanck / trime-new
View on GitHub
新·同文安卓輸入法平臺 3.x / Rime Input Method Engine for Android
☆20Updated this week
Yinghao-Li / CHMM-ALT
View on GitHub
Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"
☆32Jun 20, 2023Updated 3 years ago
cindyxinyiwang / multiview-subword-regularization
View on GitHub
PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"
☆26Jun 2, 2021Updated 5 years ago
ARBML / dar
View on GitHub
A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
☆11Jun 23, 2024Updated 2 years ago
inboxpraveen / Context-Search-Engine
View on GitHub
Context Search Engine is an AI-powered semantic document search platform built for learning, experimentation, and real-world prototyping.…
☆14Dec 24, 2025Updated 7 months ago
masakhane-io / masakhanePreprocessor
View on GitHub
Building an effective preprocessing tool for African languages
☆13Jan 24, 2024Updated 2 years ago
abdouaziz / wolof
View on GitHub
Wolof is a library that you can use to do specific tasks in NLP with the Wolof language e.g. text classification in Wolof , NMT , ASR
☆32Nov 28, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
csikasote / BembaSpeech
View on GitHub
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…
☆41Jul 31, 2025Updated 11 months ago
microsoft / MetaXL
View on GitHub
Meta Representation Transformation for Low-resource Cross-lingual Learning
☆41May 5, 2021Updated 5 years ago
dsridhar91 / hstm
View on GitHub
Code and data for "Heterogeneous Supervised Topic Models"
☆10Jun 27, 2022Updated 4 years ago
google-research / url-nlp
View on GitHub
☆273Aug 1, 2025Updated 11 months ago
DAMO-NLP-SG / AdamergeX
View on GitHub
☆11Apr 2, 2024Updated 2 years ago
lgessler / microbert
View on GitHub
A tiny BERT for low-resource monolingual models
☆32Dec 24, 2025Updated 7 months ago
kayoyin / Prodigy
View on GitHub
CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music
☆12Jan 23, 2019Updated 7 years ago