cardiffnlp / xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆150Updated 2 years ago
Alternatives and similar repositories for xlm-t:
Users that are interested in xlm-t are comparing it to the libraries listed below
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆258Updated 4 months ago
- Efficient Attention for Long Sequence Processing☆92Updated last year
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Dataset for Emotion Recognition Research☆207Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆108Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- ☆57Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆103Updated last year
- Pytorch Implementation of GoEmotions 😍😢😱☆158Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Repository for Vajjala & Lucic (2018)☆64Updated last year
- MobileBERT and DistilBERT for extractive summarization☆88Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆70Updated 2 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace☆96Updated 3 years ago
- ☆86Updated 3 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated last year
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆70Updated 3 years ago
- Detect toxic spans in toxic texts☆68Updated last year
- Datasets for Hate Speech Detection☆125Updated last year
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆44Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆33Updated 3 years ago
- Clustering sentence embeddings to extract message intent☆172Updated 3 years ago
- 1. Pretrain Albert on custom corpus 2. Finetune the pretrained Albert model on downstream task☆33Updated 4 years ago
- Few-shot Named Entity Recognition☆123Updated 2 years ago