dadelani / menyo-20k_MT
☆11Updated 3 years ago
Related projects: ⓘ
- Agile reading group that works☆13Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆27Updated last year
- Statistics on multilingual datasets☆17Updated 2 years ago
- ☆17Updated last year
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- ☆23Updated 4 years ago
- ☆20Updated 2 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 5 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆38Updated last year
- A web interface to understand language-specific BERT-models☆17Updated 5 months ago
- ☆19Updated 2 years ago
- ☆22Updated 2 years ago
- Multilingual Open Text☆25Updated 5 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆16Updated 4 months ago
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated last year
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆25Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆18Updated 2 years ago
- Dataset of ML and NLP papers☆35Updated 2 years ago
- Analysis of gutenberg dataset☆40Updated 5 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 2 years ago
- ☆12Updated 4 years ago
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆30Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆45Updated 8 months ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆16Updated 2 years ago