ijdutse / hausa-corpusLinks
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆15Updated 4 years ago
Alternatives and similar repositories for hausa-corpus
Users that are interested in hausa-corpus are comparing it to the libraries listed below
Sorting:
- Crosslingual Question Answering for African Languages☆30Updated 8 months ago
- Almost state of art text generation library☆66Updated last month
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- scipts for working with open.bible data☆24Updated 3 years ago
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- Shoonya - Platform to Annotate and label data at scale.☆54Updated 9 months ago
- ☆10Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- MAFAND-MT☆55Updated 10 months ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- ☆11Updated 3 years ago
- COMET for African languages☆10Updated 4 months ago
- Finite-state script normalization and processing utilities☆40Updated 2 weeks ago
- Predicting what word comes next with Tensorflow.☆10Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated last year
- Recipes to prepare datasets!☆14Updated 2 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 4 months ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆76Updated 2 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Updated 5 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated 2 years ago
- Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text …☆13Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago