Andrews2017 / KINNEWS-and-KIRNEWS-Corpus
Data, Embeddings, Stopword lists, code, and baselines for COLING 2020 paper titled "KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi" by Rubungo Andre Niyongabo, Hong Qu, Julia Kreutzer, and Li Huang.
☆12Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for KINNEWS-and-KIRNEWS-Corpus
- Masakhane Web is a translation web application for solely African Languages.☆36Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆66Updated 2 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆28Updated last year
- spaCy match and replace, maintaining conjugation☆34Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆92Updated 6 months ago
- Crosslingual Question Answering for African Languages☆29Updated last month
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- ☆105Updated 11 months ago
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- This is a repository for NaijaSenti. A Lacuna Funded Project for the development of sentiment corpus for four Nigerian languages: Igbo, H…☆31Updated 10 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated last month
- Code for extracting parallel corpora from pmindia☆16Updated 4 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- ☆16Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆46Updated 10 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 5 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- Rust-based Python wrapper for duckling library in Haskell☆24Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- The kinyarwanda model for deepspeech☆15Updated 3 years ago
- 🌸 Train floret vectors☆18Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆18Updated 6 months ago
- scipts for working with open.bible data☆23Updated 2 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Language detection using Spacy and Fasttext☆54Updated 11 months ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆37Updated last year
- ☆67Updated 2 years ago