cardiffnlp / xlm-tLinks
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
☆159Updated 2 years ago
Alternatives and similar repositories for xlm-t
Users that are interested in xlm-t are comparing it to the libraries listed below
Sorting:
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆96Updated 8 months ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆110Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Updated 3 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆82Updated 3 years ago
- ☆88Updated 4 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆100Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- [DEPRECATED] Adapt Transformer-based language models to new text domains☆86Updated last year
- Detect toxic spans in toxic texts☆71Updated 2 years ago
- Creating class-based TF-IDF matrices☆90Updated 3 years ago
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆71Updated 3 years ago
- Datasets for Hate Speech Detection☆134Updated 2 years ago
- Repository for Vajjala & Lucic (2018)☆67Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- ☆75Updated 4 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆137Updated last year
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences☆63Updated last year
- https://arxiv.org/pdf/1909.04054☆79Updated 3 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- ☆105Updated 4 years ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆98Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- Repository for TweetEval☆389Updated 3 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆93Updated 4 months ago
- ☆60Updated 2 years ago