AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
☆38Feb 5, 2026Updated 2 months ago
Alternatives and similar repositories for afrolid
Users that are interested in afrolid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SERENGETI: Massively Multilingual Language Models for Africa☆17Oct 26, 2023Updated 2 years ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- COMET for African languages☆11Jan 24, 2025Updated last year
- Bayesian Assessment of Hypotheses☆26Jul 6, 2023Updated 2 years ago
- Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"☆15Oct 4, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🔢 Work with static vector models☆38Apr 21, 2025Updated 11 months ago
- Targetted language identifier, based on FastText and Hunspell.☆38Sep 4, 2025Updated 7 months ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated last month
- The easiest way to update static sites hosted on GitHub Pages with a visual editor☆11Mar 28, 2018Updated 8 years ago
- ☆12Mar 7, 2022Updated 4 years ago
- English-Myanmar dictionary data☆14Aug 23, 2016Updated 9 years ago
- ☆10May 11, 2024Updated last year
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated last year
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- LLM-only topic extraction and classification☆11Sep 20, 2024Updated last year
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Apr 9, 2023Updated 3 years ago
- A framework for overviewing the performance of F0 estimators☆19Sep 10, 2016Updated 9 years ago
- A fast python implementation of the SimHash algorithm.☆27Oct 27, 2021Updated 4 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Data Collection System For NLP/Speech Recognition☆25Apr 20, 2021Updated 4 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆75Apr 1, 2025Updated last year
- Source stories from the African Storybook Project in Markdown format☆22Jan 25, 2026Updated 2 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆58Feb 3, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generating text from RDF data with sequence to sequence models☆12Jul 25, 2018Updated 7 years ago
- A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.☆13Oct 10, 2023Updated 2 years ago
- Benchmark Arabic text diacritization dataset☆77Apr 7, 2026Updated last week
- A Directory of Online Newspaper Sources for 70+ Languages☆31Apr 15, 2021Updated 4 years ago
- ☆14Jun 25, 2024Updated last year
- Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers o…☆12Jul 6, 2016Updated 9 years ago
- Experimenting with Hierarchical Attention Networks from https://arxiv.org/abs/1606.02393 in Keras☆13Oct 12, 2016Updated 9 years ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆19Mar 23, 2024Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Apr 20, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A reordering tool for machine translation.☆15May 3, 2019Updated 6 years ago
- Meedan's Open Source Arabic/English Translation Memory☆33Nov 4, 2009Updated 16 years ago
- Translation of query languages to serialized KoralQuery protocol☆14Mar 30, 2026Updated 2 weeks ago
- This is an analytical project done using python to process and extract valuable insights from WhatsApp text file, deployed as a webapp us…☆19Dec 8, 2023Updated 2 years ago
- ☆13Jul 25, 2024Updated last year
- datasets with text data for use in NLP, Text analysis, information extraction, ML research.☆16Feb 1, 2019Updated 7 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆163Apr 7, 2026Updated last week