umanlp / RedditBias
Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"
☆23Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for RedditBias
- [ACL 2020] Towards Debiasing Sentence Representations☆61Updated 2 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆60Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- ☆25Updated 2 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- ☆38Updated last year
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 11 months ago
- Repository for the Bias Benchmark for QA dataset.☆87Updated 10 months ago
- ☆16Updated 2 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆67Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆16Updated last year
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆86Updated 3 years ago
- ☆42Updated 10 months ago
- ☆87Updated 2 years ago
- ☆58Updated 2 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆67Updated last year
- This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…☆19Updated last year
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆18Updated last year
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 3 years ago
- Implementation for https://arxiv.org/abs/2005.00652☆27Updated last year
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆33Updated 2 years ago
- ☆11Updated 2 years ago
- Code for TACL 2020 paper "An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models"☆14Updated 4 years ago
- ☆24Updated 3 years ago
- ☆111Updated last year
- ☆37Updated 3 years ago
- ☆28Updated 3 years ago