dsfsi / masakhane-webLinks
Masakhane Web is a translation web application for solely African Languages.
☆37Updated last year
Alternatives and similar repositories for masakhane-web
Users that are interested in masakhane-web are comparing it to the libraries listed below
Sorting:
- ☆109Updated last year
- Crosslingual Question Answering for African Languages☆31Updated 9 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- Machine Translation for Africa☆289Updated 3 years ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 4 months ago
- ☆44Updated 3 years ago
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- MAFAND-MT☆57Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated last year
- Some notebooks for NLP☆205Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 9 months ago
- OpusFilter - Parallel corpus processing toolkit☆105Updated 2 weeks ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- Klexikon: A German Dataset for Joint Summarization and Simplification☆17Updated 2 years ago
- ☆18Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆51Updated last week
- COMET for African languages☆10Updated 5 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- Agile reading group that works☆13Updated 3 years ago
- Open information and community for machine translation☆79Updated 2 weeks ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆49Updated 4 years ago