UBC-NLP / serengeti
SERENGETI: Massively Multilingual Language Models for Africa
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for serengeti
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆28Updated last year
- Measuring if attention is explanation with ROAR☆22Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Arabic News Stance Corpus☆10Updated 3 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆18Updated 2 years ago
- ☆30Updated 4 years ago
- Religious Hate Speech Detection for Arabic Tweets☆24Updated 5 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆25Updated 7 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Introduction to Topic Modeling for TextXD 2019, 12/3/2019☆10Updated 4 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆91Updated 6 months ago
- Python tools for text☆15Updated 4 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 11 months ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- ParaNames: A multilingual resource for parallel names☆30Updated 5 months ago
- ☆11Updated 3 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆27Updated 4 years ago
- ☆19Updated last year
- ☆24Updated 5 months ago
- Multilingual Open Text☆25Updated 2 weeks ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆45Updated 10 months ago
- Named Entity Recognition with an decoder-only (autoregressive) LLM using HuggingFace☆29Updated this week
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 4 months ago
- Materials for the course Deep Learning for Natural Language Processing☆11Updated 4 months ago
- ☆12Updated 4 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆18Updated 6 months ago
- ☆17Updated last year