ijdutse / hausa-corpusLinks
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆16Updated 4 years ago
Alternatives and similar repositories for hausa-corpus
Users that are interested in hausa-corpus are comparing it to the libraries listed below
Sorting:
- Almost state of art text generation library☆66Updated last week
- Crosslingual Question Answering for African Languages☆31Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆77Updated 3 years ago
- A tiny BERT for low-resource monolingual models☆31Updated last week
- COMET for African languages☆10Updated 8 months ago
- ☆57Updated 3 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- MAFAND-MT☆59Updated last year
- Shoonya - Platform to Annotate and label data at scale.☆57Updated last year
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Updated 3 years ago
- Finite-state script normalization and processing utilities☆43Updated 2 weeks ago
- An example of multilingual machine translation using a pretrained version of mt5 from Hugging Face.☆42Updated 4 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 4 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆33Updated last month
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 2 years ago
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆17Updated 2 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆81Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆31Updated 4 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- ☆33Updated 6 years ago
- ☆10Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆41Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago