bvidgen / Dynamically-Generated-Hate-Speech-Dataset
Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).
☆42Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Dynamically-Generated-Hate-Speech-Dataset
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Code for CAET5☆23Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- ☆67Updated 3 years ago
- The Stanford Word Substitution (Swords) Benchmark☆31Updated 2 years ago
- ☆38Updated last year
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- ☆67Updated 3 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆33Updated 2 years ago
- ☆26Updated 10 months ago
- Pretraining scripts for BART transformer model☆11Updated last year
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Updated last year
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆35Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Models for automatically transforming toxic text to neutral☆33Updated last year
- ☆46Updated 4 years ago
- ☆77Updated 6 months ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆40Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated last year
- FRANK: Factuality Evaluation Benchmark☆52Updated last year
- ☆60Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- ☆57Updated last year
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated 5 months ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆48Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆86Updated last year