ijdutse / hausa-corpus
A collection of textual datasets in Hausa language and the corresponding translation in English language.
☆14Updated 3 years ago
Alternatives and similar repositories for hausa-corpus:
Users that are interested in hausa-corpus are comparing it to the libraries listed below
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆31Updated last year
- Crosslingual Question Answering for African Languages☆29Updated 4 months ago
- Almost state of art text generation library☆66Updated 3 months ago
- Hinglish Text Classification☆30Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Hausa-NMT: Empirical Study of Neural Machine translation for English-Hausa-English☆15Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Using short models to classify long texts☆21Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆35Updated last year
- Using Machine Learning to Create Funny Memes☆24Updated last year
- ☆107Updated last year
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- Comparing M2M and mT5 on a rare language pairs, blog post: https://medium.com/@abdessalemboukil/comparing-facebooks-m2m-to-mt5-in-low-re…☆16Updated 3 years ago
- Translation demonstrator☆29Updated 4 years ago
- MAFAND-MT☆55Updated 6 months ago
- Custom Named Entity Recognition annotated using NER Annotated by tecoholic and Spacy for training the model☆16Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆32Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- Implementation for WikiCheck API, an open-source Wikipedia-based fact-checking API. The project is done in cooperation with Wikimedia Fou…☆22Updated 7 months ago
- Building Chatbots with Rasa,Spacy,Wit.Ai,etc☆30Updated 6 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- Shoonya - Platform to Annotate and label data at scale.☆52Updated 4 months ago
- The primary backend service for Atila apps.☆40Updated 2 months ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆21Updated last year
- Simple pdf to text with python using PDFtk and PyPDF2☆20Updated last year