cardiffnlp / xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆152Updated 2 years ago
Alternatives and similar repositories for xlm-t:
Users that are interested in xlm-t are comparing it to the libraries listed below
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"☆82Updated last year
- Dataset for Emotion Recognition Research☆211Updated 2 years ago
- Datasets for Hate Speech Detection☆126Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 5 months ago
- Efficient Attention for Long Sequence Processing☆93Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆71Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆106Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 3 years ago
- Repository for TweetEval☆372Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆28Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- XED multilingual emotion datasets☆58Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆204Updated last year
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆75Updated last year
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Testing and training detection models for emoji-based hate speech.☆24Updated 2 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆387Updated last year
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Updated 3 weeks ago
- Multi-dataset stance detection and robustness experiments☆44Updated last year
- This is a simple Python package for calculating a variety of lexical diversity indices☆75Updated last year
- A python package for text preprocessing task in natural language processing.☆63Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago