castorini/afriberta

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/castorini/afriberta)

castorini / afriberta

AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages

☆83

Alternatives and similar repositories for afriberta

Users that are interested in afriberta are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hausanlp / NaijaSenti
View on GitHub
This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…
☆39Oct 14, 2025Updated 9 months ago
masakhane-io / masakhane-ner
View on GitHub
☆122Oct 15, 2025Updated 9 months ago
masakhane-io / masakhane-news
View on GitHub
MasakhaNEWS: News Topic Classification for African Languages
☆26May 12, 2024Updated 2 years ago
Andrews2017 / africanlp-public-datasets
View on GitHub
A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.
☆117Apr 26, 2024Updated 2 years ago
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Andrews2017 / KINNEWS-and-KIRNEWS-Corpus
View on GitHub
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …
☆15Apr 26, 2024Updated 2 years ago
afrisenti-semeval / afrisent-semeval-2023
View on GitHub
AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/
☆53Jan 10, 2024Updated 2 years ago
UBC-NLP / serengeti
View on GitHub
SERENGETI: Massively Multilingual Language Models for Africa
☆17Oct 26, 2023Updated 2 years ago
bonaventuredossou / ffr-v1
View on GitHub
Towards developing a Robust Translation Model for African languages: Pilot Project FFR v1.0.
☆47May 12, 2024Updated 2 years ago
masakhane-io / masakhane-mt
View on GitHub
Machine Translation for Africa
☆322Jun 14, 2022Updated 4 years ago
cindyxinyiwang / expand-via-lexicon-based-adaptation
View on GitHub
Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"
☆29Apr 2, 2022Updated 4 years ago
masakhane-io / lafand-mt
View on GitHub
MAFAND-MT
☆63Jul 9, 2024Updated 2 years ago
formiel / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆20May 13, 2026Updated 2 months ago
bonaventuredossou / MLM_AL
View on GitHub
☆24May 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
evelynkyl / yue_nmt
View on GitHub
Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project
☆16Oct 28, 2022Updated 3 years ago
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
Mister-iks / ai_suggest_deployment
View on GitHub
AI SUGGEST is a powerful command-line assistant that leverages AI to provide accurate Linux commands based on natural language queries. S…
☆11Aug 22, 2024Updated last year
AmericasNLP / americasnlp2023
View on GitHub
☆10May 15, 2023Updated 3 years ago
facebookresearch / flores
View on GitHub
Facebook Low Resource (FLoRes) MT Benchmark
☆771Nov 20, 2023Updated 2 years ago
uds-lsv / afro-maft
View on GitHub
☆17Jan 12, 2023Updated 3 years ago
talsperre / Flask-Stackoverflow-App
View on GitHub
A feature rich Stackoverflow clone made using Flask, SQLite3 & JS
☆16Sep 7, 2017Updated 8 years ago
dadelani / africanlp-resources
View on GitHub
List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond
☆13Aug 15, 2022Updated 3 years ago
SLAB-NLP / BUG
View on GitHub
A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021
☆14Apr 3, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ehsanasgari / 1000Langs
View on GitHub
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆33Dec 8, 2022Updated 3 years ago
masakhane-io / masakhanePreprocessor
View on GitHub
Building an effective preprocessing tool for African languages
☆13Jan 24, 2024Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
coqui-ai / open-bible-scripts
View on GitHub
scipts for working with open.bible data
☆26Jan 24, 2022Updated 4 years ago
alvations / SeedLing
View on GitHub
Building and Using A Seed Corpus for the Human Language Project
☆11Feb 9, 2018Updated 8 years ago
GT4SD / zero-shot-bert-adapters
View on GitHub
Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.
☆44Jun 13, 2023Updated 3 years ago
asmelashteka / HornMT
View on GitHub
Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
☆46Oct 13, 2022Updated 3 years ago
prithuls / MV-Swin-T
View on GitHub
☆19Feb 28, 2024Updated 2 years ago
Patil-Onkar / Remove-silence-from-an-audio
View on GitHub
☆10Jun 30, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NI57721 / vim-socrates
View on GitHub
A Vim/Neovim plugin for typing ancient Greek characters
☆13Jul 17, 2025Updated last year
Gabriel-Ducrocq / Tennis
View on GitHub
Model for predicting WTA tennis matches outcome.
☆15Feb 19, 2015Updated 11 years ago
neulab / newlang-tech
View on GitHub
A guide to building language technology in new languages.
☆59Feb 1, 2022Updated 4 years ago
Mildemelwe / Non-English-Tacotron-2-Training-Notebook
View on GitHub
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
☆11Nov 19, 2022Updated 3 years ago
Harry-Chan / seq2seqlm-on-qg
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 3 years ago
szaza / dataset-generator
View on GitHub
It is a very simple program to generate Pascal VOC style learning dataset from images. It generates images and XML style annotations with…
☆11Apr 21, 2018Updated 8 years ago