BrendanKennedy / contextualizing-hate-speech-models-with-explanations
Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"
☆34Updated 3 years ago
Alternatives and similar repositories for contextualizing-hate-speech-models-with-explanations:
Users that are interested in contextualizing-hate-speech-models-with-explanations are comparing it to the libraries listed below
- Dataset + classifier tools to study social perception biases in natural language generation☆66Updated last year
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆13Updated 3 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆64Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- ☆38Updated last year
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Updated last year
- ☆19Updated 2 years ago
- ☆71Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆36Updated 2 years ago
- Implementation for https://arxiv.org/abs/2005.00652☆28Updated 2 years ago
- Perspectrum: a dataset of claims, perspectives and evidence documents☆33Updated 5 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆20Updated 3 years ago
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆57Updated last year
- How Contextual are Contextualized Word Representations?☆41Updated 4 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆60Updated 2 years ago
- ☆61Updated 2 years ago
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆30Updated 4 years ago
- ☆38Updated 3 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 3 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆87Updated 3 years ago
- ☆43Updated 5 years ago
- ☆38Updated last year
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Updated last year
- ☆58Updated 2 years ago
- This is a repo for the EMNLP 19 Paper on gender bias in gendered languages.☆23Updated 5 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated 2 years ago
- Official code for the paper "PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models".☆16Updated 2 years ago
- ☆64Updated 4 years ago