ijdutse / hausa-corpus
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆14Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for hausa-corpus
- Crosslingual Question Answering for African Languages☆29Updated last month
- MAFAND-MT☆54Updated 4 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆18Updated 6 months ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆31Updated 10 months ago
- Almost state of art text generation library☆66Updated 3 weeks ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆46Updated 10 months ago
- Hugging Face and Pyserini interoperability☆19Updated last year
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Tool to take your ML model from local to production with one-line of code.☆23Updated 10 months ago
- Meme search engine built with Jina neural search framework. Search with captions or image files to find matching memes.☆23Updated 2 years ago
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆14Updated 4 years ago
- ☆19Updated 6 months ago
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆33Updated last year
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆29Updated last month
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 5 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- ☆13Updated 2 years ago
- Finite-state script normalization and processing utilities☆38Updated this week
- All our community docs! Start here! Lets put Africa on the NLP Map☆54Updated 7 months ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- ☆15Updated 3 years ago
- aiXplain enables python programmers to add AI functions to their software.☆27Updated this week