Niger-Volta-LTI / yoruba-adr
Automatic Diacritic Restoration of Yorùbá language Text
☆24Updated 9 months ago
Alternatives and similar repositories for yoruba-adr
Users that are interested in yoruba-adr are comparing it to the libraries listed below
Sorting:
- Yorùbá language training text for NLP, ASR and TTS tasks☆76Updated 2 years ago
- Unsupervised Neural Machine Translation from West African Pidgin (Creole) to English without a single parallel sentence☆78Updated 4 years ago
- Ìrànlọ́wọ́ is a utility library for analysis & (pre)processing of Yorùbá text → https://pypi.org/project/iranlowo☆19Updated 2 years ago
- 📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.☆22Updated 4 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆105Updated last year
- A Simple Flask App to interact with your Machine Translation Model☆12Updated 5 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆75Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆73Updated 2 years ago
- All our community docs! Start here! Lets put Africa on the NLP Map☆60Updated last year
- Machine Translation for Africa☆288Updated 2 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- ☆42Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 3 months ago
- Fast and accurate spell correction library☆81Updated 3 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆54Updated 4 years ago
- ☆12Updated 3 years ago
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- List of research and engineering of NLP for American Native/Indigenous Languages.☆92Updated 4 years ago
- Almost state of art text generation library☆66Updated last week
- ☆110Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Efficient Low-Memory Aligner☆143Updated 4 months ago