Building an effective preprocessing tool for African languages
☆13Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for masakhanePreprocessor
Users that are interested in masakhanePreprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆14Apr 26, 2024Updated last year
- MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021☆13Jan 16, 2023Updated 3 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆67Apr 16, 2024Updated last year
- A utility micro-crate for using `Into` more ergonomically.☆12May 17, 2021Updated 4 years ago
- MAFAND-MT☆61Jul 9, 2024Updated last year
- AutoDoc is a mobile app for iOS and Android for medical purpose that helps you chat with an automated general practice doctor and get a q…☆10Jan 19, 2020Updated 6 years ago
- ☆13Jan 13, 2025Updated last year
- ☆12Mar 7, 2022Updated 4 years ago
- Kosmos technical report figures, validation code, and reproducible analyses☆29Nov 4, 2025Updated 4 months ago
- Mobile app that provides notifications about the status of the James Webb Space Telescope☆14Aug 3, 2023Updated 2 years ago
- DeepKIN -- A deep learning toolkit for Kinyarwanda NLP.☆14Jun 4, 2025Updated 9 months ago
- Future-based USB host API for Rust☆17Jun 7, 2019Updated 6 years ago
- Visualizing Intergenerational Wealth Mobility and Racial Inequality☆10Mar 21, 2019Updated 7 years ago
- Boilerplate for bundling serverless functions with webpack locally, prior to uploading to the CMS.☆14Mar 4, 2023Updated 3 years ago
- ☆20Feb 4, 2024Updated 2 years ago
- ☆16Mar 13, 2022Updated 4 years ago
- Ghost Blank is – guess what! – a blank theme for the new publishing platform Ghost.☆31Jul 20, 2014Updated 11 years ago
- OpenStratos written in Rust.☆19Jun 18, 2023Updated 2 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- A LaTeX package to typeset and index linguistic gloss abbreviations☆16May 22, 2022Updated 3 years ago
- A starter Ghost theme with Twitter Bootstrap integration☆43May 26, 2017Updated 8 years ago
- The python curation library for lexibank☆21Feb 12, 2026Updated last month
- The Metadata Editor for Transparent Archiving of language document materials☆23Updated this week
- GraphOfDocs: Representing multiple documents as a single graph☆21Jun 22, 2022Updated 3 years ago
- Sample notebooks for using the Global Database of Events, Language and Tone (GDELT).☆19Nov 8, 2020Updated 5 years ago
- The NLPStatTest project☆12Mar 12, 2022Updated 4 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Conversion of the LCC outline schedules from PDF to JSON☆27Apr 2, 2020Updated 5 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Oct 27, 2022Updated 3 years ago
- bevy plugin for starting a webserver to visually edit bevy resources☆22Jan 21, 2021Updated 5 years ago
- ⚙️ Das Backend zu OffeneGesetze.de☆25Jan 11, 2024Updated 2 years ago
- POS for African languages☆19Jun 25, 2025Updated 8 months ago
- Code to create the dataset from "A New Aligned Simple German Corpus☆12Jan 8, 2024Updated 2 years ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Jul 22, 2022Updated 3 years ago
- Runbooks for FOLIO installation☆21Feb 5, 2026Updated last month
- MasakhaNEWS: News Topic Classification for African Languages☆25May 12, 2024Updated last year
- Code supporting the paper Graph-Embedding Empowered Entity Retrieval☆24Apr 11, 2025Updated 11 months ago
- ☆18Feb 1, 2023Updated 3 years ago
- ☆118Oct 15, 2025Updated 5 months ago