dadelani / menyo-20k_MT
☆11Updated 3 years ago
Alternatives and similar repositories for menyo-20k_MT:
Users that are interested in menyo-20k_MT are comparing it to the libraries listed below
- ☆22Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- A web interface to understand language-specific BERT-models☆17Updated 11 months ago
- Agile reading group that works☆13Updated 3 years ago
- ☆19Updated 2 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 9 months ago
- ☆17Updated 6 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆17Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28Updated 2 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆23Updated 10 months ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 3 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆48Updated last year
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- ☆24Updated 5 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19Updated 2 years ago
- ☆10Updated 6 years ago
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- Arabic News Stance Corpus☆10Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆42Updated 3 years ago
- ☆17Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 2 weeks ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- EMNLP Findings 2020: Reevaluating Adversarial Examples in Natural Language☆7Updated 4 years ago
- ☆15Updated 4 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- ☆12Updated 4 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year