cardiffnlp / xlm-tLinks
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆158Updated 2 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below
Sorting:
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Detect toxic spans in toxic texts☆70Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆137Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆95Updated 7 months ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆205Updated 3 years ago
- ☆88Updated 3 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 3 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Updated last year
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆222Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆92Updated 3 months ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- Datasets for Hate Speech Detection☆132Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆33Updated 3 years ago
- ☆75Updated 4 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆34Updated 4 years ago
- Dataset for Emotion Recognition Research☆216Updated 2 years ago
- Creating class-based TF-IDF matrices☆90Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Introduction to the recently released T5 model from the paper - Exploring the Limits of Transfer Learning with a Unified Text-to-Text Tra…☆35Updated 5 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Updated 4 years ago
- Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"☆82Updated 2 years ago