UBC-NLP/afrolid

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UBC-NLP/afrolid)

UBC-NLP / afrolid

AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.

☆39

Alternatives and similar repositories for afrolid

Users that are interested in afrolid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UBC-NLP / serengeti
View on GitHub
SERENGETI: Massively Multilingual Language Models for Africa
☆17Oct 26, 2023Updated 2 years ago
cisnlp / GlotWeb
View on GitHub
[WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages
☆17Apr 14, 2026Updated 3 months ago
masakhane-io / africomet
View on GitHub
COMET for African languages
☆11Jan 24, 2025Updated last year
neuml / staticvectors
View on GitHub
🔢 Work with static vector models
☆39Apr 21, 2025Updated last year
fdschmidt93 / trident-nllb-llm2vec
View on GitHub
Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"
☆15Oct 4, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LauraMartinus / ukuxhumana
View on GitHub
Neural Machine Translation for South African Languages
☆40Dec 8, 2022Updated 3 years ago
mbanon / fastspell
View on GitHub
Targetted language identifier, based on FastText and Hunspell.
☆38Sep 4, 2025Updated 10 months ago
MatthewHallberg / CoronaVisualizer
View on GitHub
☆11Apr 1, 2020Updated 6 years ago
Tangerine-Community / tangy-form
View on GitHub
<tangy-form> is a web component for creating multipage forms. Other <tangy-*> input elements are included as well.
☆16Mar 30, 2026Updated 3 months ago
LucyMcGowan / comps-survival-guide
View on GitHub
A survival guide for Vanderbilt Biostatistics first year comprehensive exams
☆14May 12, 2020Updated 6 years ago
lm-pub-quiz / lm-pub-quiz
View on GitHub
Evaluate language models using multiple choice items
☆13Mar 6, 2026Updated 4 months ago
transferwise / wise-topic
View on GitHub
LLM-only topic extraction and classification
☆11Jun 3, 2026Updated last month
andreburgaud / robotspy
View on GitHub
Alternative robots parser module for Python
☆22Jun 19, 2026Updated last month
muety / linkeddata-trivia
View on GitHub
Auto-generated trivia questions based on DBPedia data.
☆15Feb 26, 2017Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
aleju / CharVectorizer
View on GitHub
Transform strings to vectors for neural networks.
☆15May 22, 2015Updated 11 years ago
UBC-NLP / palm
View on GitHub
☆32Mar 21, 2026Updated 4 months ago
deep-spin / sparse-communication
View on GitHub
☆12Mar 7, 2022Updated 4 years ago
kpu / fasterText
View on GitHub
Library for fast text representation and classification.
☆31Jan 9, 2024Updated 2 years ago
mcfnlp / Dictionary
View on GitHub
English-Myanmar dictionary data
☆15Aug 23, 2016Updated 9 years ago
MicrosoftTranslator / NTREX
View on GitHub
NTREX -- News Test References for MT Evaluation
☆87Jun 5, 2024Updated 2 years ago
Tomiinek / Aargh
View on GitHub
☆12Jan 2, 2024Updated 2 years ago
UBC-NLP / turjuman
View on GitHub
TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).
☆58Apr 9, 2023Updated 3 years ago
jacobmarks / emoji_search
View on GitHub
Semantically Search Emojis From the Command Line!
☆13Nov 26, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mmorise / tusk
View on GitHub
A framework for overviewing the performance of F0 estimators
☆19Sep 10, 2016Updated 9 years ago
afrith / election-map-frontend
View on GitHub
Interactive map of South African election results visualised in various ways
☆13Jun 14, 2024Updated 2 years ago
chpollin / Teaching
View on GitHub
This repository contains my teaching material. Most of it is in German.
☆13Updated this week
laurieburchell / open-lid-dataset
View on GitHub
Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)
☆77Apr 1, 2025Updated last year
cisnlp / GlotScript
View on GitHub
[LREC 2024] 🖋 Resource and Tool for Writing System Identification
☆22Mar 29, 2026Updated 3 months ago
hplt-project / OpusCleaner
View on GitHub
OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.
☆58Feb 3, 2026Updated 5 months ago
badrex / rdf2text
View on GitHub
Generating text from RDF data with sequence to sequence models
☆11Jul 25, 2018Updated 7 years ago
karthikncode / MorphoChain
View on GitHub
A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.
☆13Oct 10, 2023Updated 2 years ago
annakrystalli / rrresearchACCE20
View on GitHub
Materials associated with ACCE DTP course on Reproducible Research Data & Project Management in R
☆10Sep 3, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jasonmayes / Retraining-TensorFlow-Classifier-Using-Video
View on GitHub
Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers o…
☆12Jul 6, 2016Updated 10 years ago
kadarakos / hieratt
View on GitHub
Experimenting with Hierarchical Attention Networks from https://arxiv.org/abs/1606.02393 in Keras
☆13Oct 12, 2016Updated 9 years ago
AliOsm / arabic-text-diacritization
View on GitHub
Benchmark Arabic text diacritization dataset
☆78Apr 7, 2026Updated 3 months ago
mixedbread-ai / binary-embeddings
View on GitHub
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…
☆19Mar 23, 2024Updated 2 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
azpoliak / eco
View on GitHub
Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)
☆15Apr 6, 2017Updated 9 years ago
cisnlp / Glot500
View on GitHub
[ACL 2023] Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
☆107Apr 14, 2026Updated 3 months ago