SunbirdAI / salt-data-archiveLinks
Multi-way parallel text corpus of 5 key Ugandan languages.
☆17Updated last year
Alternatives and similar repositories for salt-data-archive
Users that are interested in salt-data-archive are comparing it to the libraries listed below
Sorting:
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆113Updated last year
- Machine Translation for Africa☆308Updated 3 years ago
- ☆117Updated 3 months ago
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆67Updated last year
- Repository contains various Malayalam ASR based resources curated from multiple sources☆18Updated 4 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆82Updated 2 years ago
- A large scale Sanskrit-English translation dataset☆76Updated 2 years ago
- ☆22Updated 3 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆58Updated 3 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆762Updated 2 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆36Updated 6 months ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆205Updated 5 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Updated this week
- A french sequence to sequence pretrained model☆63Updated 3 years ago
- ☆45Updated 3 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Updated 5 years ago
- A guide to building language technology in new languages.☆59Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆36Updated 4 years ago
- A python true casing utility that restores case information for texts☆88Updated 3 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- Some notebooks for NLP☆207Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆321Updated last year
- Morphological processing for languages of the Horn of Africa☆54Updated last month
- Fair Embedding Engine☆13Updated 5 years ago
- Bicleaner fork that uses neural networks☆40Updated last week