ijdutse / hausa-corpus
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆15Updated 4 years ago
Alternatives and similar repositories for hausa-corpus
Users that are interested in hausa-corpus are comparing it to the libraries listed below
Sorting:
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- Crosslingual Question Answering for African Languages☆30Updated 7 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated last year
- Almost state of art text generation library☆66Updated last week
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆73Updated 2 years ago
- MAFAND-MT☆55Updated 10 months ago
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆17Updated 4 years ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- ☆110Updated last year
- Building an effective preprocessing tool for African languages☆12Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- COMET for African languages☆10Updated 3 months ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆14Updated 8 months ago
- scipts for working with open.bible data☆24Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Hinglish Text Classification☆30Updated last year
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 3 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- ☆15Updated 6 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆11Updated 9 years ago
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- A python package for whisper normalizer☆59Updated last week
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year