cardiffnlp / xlm-tLinks
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆156Updated 2 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below
Sorting:
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 6 months ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆203Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Creating class-based TF-IDF matrices☆84Updated 2 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitter☆107Updated last year
- ☆87Updated 3 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆70Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆72Updated 2 years ago
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆91Updated 2 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆108Updated last year
- Efficient Attention for Long Sequence Processing☆93Updated last year
- Repository for TweetEval☆376Updated 2 years ago
- ☆161Updated 11 months ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- ☆59Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- Datasets for Hate Speech Detection☆128Updated 2 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆83Updated last year
- Dataset for Emotion Recognition Research☆211Updated 2 years ago
- ☆345Updated 3 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆207Updated last year
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆76Updated last year