cardiffnlp / xlm-tLinks
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆157Updated 2 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below
Sorting:
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆203Updated 2 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆202Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆108Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆73Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- ☆87Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 4 months ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆87Updated last year
- Repository for TweetEval☆379Updated 3 years ago
- Creating class-based TF-IDF matrices☆86Updated 2 years ago
- ☆75Updated 4 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago
- Detect toxic spans in toxic texts☆69Updated 2 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- ☆103Updated 4 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆271Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Datasets for Hate Speech Detection☆130Updated 2 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆67Updated 3 years ago