Andrews2017 / KINNEWS-and-KIRNEWS-CorpusLinks
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
☆13Updated last year
Alternatives and similar repositories for KINNEWS-and-KIRNEWS-Corpus
Users that are interested in KINNEWS-and-KIRNEWS-Corpus are comparing it to the libraries listed below
Sorting:
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Masakhane Web is a translation web application for solely African Languages.☆37Updated 2 years ago
- ☆116Updated 2 months ago
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆35Updated 2 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆45Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆105Updated last year
- Crosslingual Question Answering for African Languages☆30Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- MAFAND-MT☆60Updated last year
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- Building an effective preprocessing tool for African languages☆13Updated last year
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 7 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆74Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated this week
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆105Updated last year
- Unannotated Spanish 3 Billion Words Corpora☆104Updated 3 years ago
- ☆45Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated this week
- Some notebooks for NLP☆207Updated 2 years ago
- Language Models for Zalando's flair library☆61Updated 5 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 4 years ago
- Explainable Zero-Shot Topic Extraction☆65Updated last year