ijdutse / hausa-corpusLinks
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆16Updated 4 years ago
Alternatives and similar repositories for hausa-corpus
Users that are interested in hausa-corpus are comparing it to the libraries listed below
Sorting:
- Crosslingual Question Answering for African Languages☆30Updated last year
- Almost state of art text generation library☆66Updated last week
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆78Updated 3 years ago
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- MAFAND-MT☆59Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆41Updated 2 years ago
- ☆10Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 2 months ago
- ☆57Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆41Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 3 years ago
- Translation demonstrator☆34Updated 5 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆21Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 5 years ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- Text to Speech for Indic languages☆52Updated 3 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆81Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 3 years ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆43Updated 4 years ago
- Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3☆18Updated 4 years ago
- YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training☆45Updated 5 years ago
- This repo contains 3 hours of audio speech recordings in Yoruba language collected for research purposes.☆18Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Updated 4 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Updated last year