dsfsi / masakhane-webLinks
Masakhane Web is a translation web application for solely African Languages.
☆37Updated 2 years ago
Alternatives and similar repositories for masakhane-web
Users that are interested in masakhane-web are comparing it to the libraries listed below
Sorting:
- ☆115Updated last month
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- Crosslingual Question Answering for African Languages☆31Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆77Updated 3 years ago
- Machine Translation for Africa☆298Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆34Updated 8 months ago
- ☆21Updated 3 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆34Updated last month
- COMET for African languages☆10Updated 9 months ago
- Agile reading group that works☆13Updated 3 years ago
- MAFAND-MT☆59Updated last year
- A guide to building language technology in new languages.☆59Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Updated 4 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆52Updated last month
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆31Updated last month
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- ☆45Updated 3 years ago
- Some notebooks for NLP☆207Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆34Updated 2 months ago
- OpusFilter - Parallel corpus processing toolkit☆112Updated last week
- A tool that locates, downloads, and extracts machine translation corpora☆159Updated 2 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated last year
- ☆78Updated 3 months ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year