Andrews2017 / KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
☆12Updated 10 months ago
Alternatives and similar repositories for KINNEWS-and-KIRNEWS-Corpus:
Users that are interested in KINNEWS-and-KIRNEWS-Corpus are comparing it to the libraries listed below
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆69Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆108Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆31Updated last year
- Crosslingual Question Answering for African Languages☆29Updated 5 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆102Updated 10 months ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- Masakhane Web is a translation web application for solely African Languages.☆36Updated last year
- ☆17Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated last month
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆39Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 9 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- The kinyarwanda model for deepspeech☆15Updated 3 years ago
- scipts for working with open.bible data☆24Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆19Updated 9 months ago
- MAFAND-MT☆55Updated 7 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- This is a neural spell checker☆65Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- ☆17Updated 6 months ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 10 months ago