XuhuiZhou / Toxic_Debias
code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A. Smith and Yejin Choi
☆19Updated 3 years ago
Alternatives and similar repositories for Toxic_Debias:
Users that are interested in Toxic_Debias are comparing it to the libraries listed below
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆14Updated 3 years ago
- ☆58Updated 3 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆65Updated 2 years ago
- ☆39Updated last year
- Dataset + classifier tools to study social perception biases in natural language generation☆68Updated last year
- ☆39Updated 3 years ago
- ☆71Updated 3 years ago
- Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"☆22Updated 3 years ago
- ☆17Updated last year
- ☆25Updated 3 years ago
- ☆48Updated 2 years ago
- ☆15Updated 3 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Updated 3 years ago
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆17Updated 3 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated last year
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 4 years ago
- ☆44Updated last year
- ☆24Updated last year
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆61Updated 2 years ago
- ☆24Updated 3 years ago
- ☆20Updated 2 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Updated last year
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆36Updated 3 years ago
- Data and code for the "Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences" (Emelin et al., 2021) pap…☆58Updated 2 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 3 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆21Updated 4 years ago