AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆39Feb 5, 2026Updated 3 months ago
Alternatives and similar repositories for afrolid
Users that are interested in afrolid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated last month
- COMET for African languages☆11Jan 24, 2025Updated last year
- Bayesian Assessment of Hypotheses☆26Jul 6, 2023Updated 2 years ago
- Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"☆15Oct 4, 2024Updated last year
- Neural Machine Translation for South African Languages☆40Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🔢 Work with static vector models☆39Apr 21, 2025Updated last year
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 8 months ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 2 months ago
- Alternative robots parser module for Python☆22Apr 8, 2026Updated last month
- ☆12Mar 7, 2022Updated 4 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- ☆12Jan 2, 2024Updated 2 years ago
- ☆10May 11, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated last year
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- LLM-only topic extraction and classification☆11Sep 20, 2024Updated last year
- A fast python implementation of the SimHash algorithm.☆27Oct 27, 2021Updated 4 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Data Collection System For NLP/Speech Recognition☆25Apr 20, 2021Updated 5 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆76Apr 1, 2025Updated last year
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14Apr 15, 2026Updated last month
- [LREC 2024] 🖋 Resource and Tool for Writing System Identification☆21Mar 29, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Hieroglyphs Everywhere fonts☆25Nov 28, 2021Updated 4 years ago
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated 4 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 3 months ago
- Morpha lex stemmer converted using jflex.☆24Oct 12, 2020Updated 5 years ago
- A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.☆13Oct 10, 2023Updated 2 years ago
- Benchmark Arabic text diacritization dataset☆79Apr 7, 2026Updated last month
- A Directory of Online Newspaper Sources for 70+ Languages☆31Apr 15, 2021Updated 5 years ago
- ☆14Jun 25, 2024Updated last year
- A library for data streaming and augmentation☆21May 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experimenting with Hierarchical Attention Networks from https://arxiv.org/abs/1606.02393 in Keras☆13Oct 12, 2016Updated 9 years ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆19Mar 23, 2024Updated 2 years ago
- ☆21Jul 22, 2022Updated 3 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- A reordering tool for machine translation.☆15May 3, 2019Updated 7 years ago