BrendanKennedy / contextualizing-hate-speech-models-with-explanations
Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"
☆33Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for contextualizing-hate-speech-models-with-explanations
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆66Updated last year
- ☆38Updated last year
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆36Updated 5 years ago
- ☆57Updated 2 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆19Updated last year
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆13Updated 3 years ago
- Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (h…☆81Updated 3 years ago
- [ACL 2020] Towards Debiasing Sentence Representations☆61Updated last year
- Repository describing example random control tasks for designing and interpreting neural probes☆31Updated 2 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆60Updated 2 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Symmetric evaluation set based on the FEVER (fact verification) dataset☆50Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆33Updated 2 years ago
- ☆60Updated last year
- A unified approach to explain conditional text generation models. Pytorch. The code of paper "Local Explanation of Dialogue Response Gene…☆18Updated 2 years ago
- ☆28Updated 3 years ago
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆47Updated last year
- Diagnostic tests for linguistic capacities in language models☆66Updated 2 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated last year
- Data and code for the "Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences" (Emelin et al., 2021) pap…☆51Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆97Updated last year
- Code and datasets for the EMNLP 2020 paper "Calibration of Pre-trained Transformers"☆55Updated last year
- Perspectrum: a dataset of claims, perspectives and evidence documents☆32Updated 4 years ago
- [AAAI2021] Unsupervised Opinion Summarization with Content Planning☆32Updated 2 years ago
- Implementation for https://arxiv.org/abs/2005.00652☆27Updated last year
- ☆41Updated 3 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- ☆37Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago