Andrews2017 / KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
☆12Updated last year
Alternatives and similar repositories for KINNEWS-and-KIRNEWS-Corpus:
Users that are interested in KINNEWS-and-KIRNEWS-Corpus are comparing it to the libraries listed below
- Building an effective preprocessing tool for African languages☆12Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆73Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆104Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Masakhane Web is a translation web application for solely African Languages.☆36Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- COMET for African languages☆10Updated 3 months ago
- ☆110Updated last year
- Crosslingual Question Answering for African Languages☆29Updated 6 months ago
- ☆32Updated 6 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated 11 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- MAFAND-MT☆55Updated 9 months ago
- ☆64Updated 2 years ago
- ☆17Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Hinglish Text Classification☆30Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆35Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 11 months ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago