cardiffnlp / xlm-tLinks
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆157Updated 2 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below
Sorting:
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆108Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆263Updated 7 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆109Updated last year
- Datasets for Hate Speech Detection☆130Updated 2 years ago
- ☆87Updated 3 years ago
- Creating class-based TF-IDF matrices☆84Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Dataset for Emotion Recognition Research☆212Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 9 months ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 3 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- ☆60Updated 4 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- ☆103Updated 3 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- A multilingual lexicon of words to hurt.☆89Updated this week
- This is a simple Python package for calculating a variety of lexical diversity indices☆77Updated last year
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆70Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆53Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 3 months ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- XED multilingual emotion datasets☆61Updated 2 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆33Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- ☆346Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆162Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago