Andrews2017 / KINNEWS-and-KIRNEWS-CorpusLinks
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
☆13Updated last year
Alternatives and similar repositories for KINNEWS-and-KIRNEWS-Corpus
Users that are interested in KINNEWS-and-KIRNEWS-Corpus are comparing it to the libraries listed below
Sorting:
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆105Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- Generate reports for spaCy models.☆29Updated 3 years ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆32Updated last year
- ☆110Updated last year
- Crosslingual Question Answering for African Languages☆30Updated 8 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated last year
- Masakhane Web is a translation web application for solely African Languages.☆37Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 2 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆155Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆99Updated last year
- Just another sentiment wrapper.☆17Updated 3 years ago
- Bag of, not words, but tricks!☆68Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- COMET for African languages☆10Updated 4 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆44Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Easy PDF to text to spaCy text extraction in Python.☆39Updated 8 months ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Updated 3 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- ☆30Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago