Building an effective preprocessing tool for African languages
☆12Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for masakhanePreprocessor
Users that are interested in masakhanePreprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021☆13Jan 16, 2023Updated 3 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆66Apr 16, 2024Updated last year
- A utility micro-crate for using `Into` more ergonomically.☆12May 17, 2021Updated 4 years ago
- MAFAND-MT☆62Jul 9, 2024Updated last year
- ☆12Jan 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Mar 7, 2022Updated 4 years ago
- Kosmos technical report figures, validation code, and reproducible analyses☆28Nov 4, 2025Updated 5 months ago
- DeepKIN -- A deep learning toolkit for Kinyarwanda NLP.☆13Jun 4, 2025Updated 10 months ago
- Mobile app that provides notifications about the status of the James Webb Space Telescope☆14Aug 3, 2023Updated 2 years ago
- Future-based USB host API for Rust☆17Jun 7, 2019Updated 6 years ago
- Visualizing Intergenerational Wealth Mobility and Racial Inequality☆10Mar 21, 2019Updated 7 years ago
- Boilerplate for bundling serverless functions with webpack locally, prior to uploading to the CMS.☆14Mar 4, 2023Updated 3 years ago
- ☆19Feb 4, 2024Updated 2 years ago
- ☆16Mar 13, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Ghost Blank is – guess what! – a blank theme for the new publishing platform Ghost.☆31Jul 20, 2014Updated 11 years ago
- OpenStratos written in Rust.☆18Jun 18, 2023Updated 2 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- A LaTeX package to typeset and index linguistic gloss abbreviations☆16May 22, 2022Updated 3 years ago
- A starter Ghost theme with Twitter Bootstrap integration☆43May 26, 2017Updated 8 years ago
- The python curation library for lexibank☆21Feb 12, 2026Updated 2 months ago
- GraphOfDocs: Representing multiple documents as a single graph☆21Jun 22, 2022Updated 3 years ago
- The Metadata Editor for Transparent Archiving of language document materials☆24Mar 22, 2026Updated 3 weeks ago
- Sample notebooks for using the Global Database of Events, Language and Tone (GDELT).☆19Nov 8, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Conversion of the LCC outline schedules from PDF to JSON☆27Apr 2, 2020Updated 6 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- bevy plugin for starting a webserver to visually edit bevy resources☆22Jan 21, 2021Updated 5 years ago
- ⚙️ Das Backend zu OffeneGesetze.de☆25Jan 11, 2024Updated 2 years ago
- POS for African languages☆19Jun 25, 2025Updated 9 months ago
- Code to create the dataset from "A New Aligned Simple German Corpus☆12Jan 8, 2024Updated 2 years ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Jul 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Runbooks for FOLIO installation☆21Feb 5, 2026Updated 2 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆26May 12, 2024Updated last year
- ☆118Oct 15, 2025Updated 5 months ago
- Code supporting the paper Graph-Embedding Empowered Entity Retrieval☆24Apr 11, 2025Updated last year
- ☆18Feb 1, 2023Updated 3 years ago
- ☆19Nov 14, 2022Updated 3 years ago
- Klexikon: A German Dataset for Joint Summarization and Simplification☆17Oct 5, 2022Updated 3 years ago