bvidgen / Dynamically-Generated-Hate-Speech-Dataset
Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).
☆40Updated 3 years ago
Related projects: ⓘ
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated 11 months ago
- ☆37Updated last year
- ☆66Updated 2 years ago
- Code for CAET5☆23Updated last year
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆33Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 2 years ago
- ☆57Updated last year
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆26Updated 2 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆19Updated last year
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆64Updated 3 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- ☆40Updated 3 years ago
- The code repository for NAACL 2021 paper "AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization".☆35Updated 3 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆71Updated 2 years ago
- ☆37Updated last year
- ☆40Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆50Updated last year
- The Stanford Word Substitution (Swords) Benchmark☆31Updated 2 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated last year
- Using BERT for long sentence classification (more than 512 word pieces).☆17Updated 3 years ago
- FRANK: Factuality Evaluation Benchmark☆51Updated last year
- ☆25Updated 2 years ago
- The data and code for EmailSum☆54Updated 3 years ago
- ☆92Updated 2 years ago