TManzini / DebiasMulticlassWordEmbeddingLinks
NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"
☆41Updated 6 years ago
Alternatives and similar repositories for DebiasMulticlassWordEmbedding
Users that are interested in DebiasMulticlassWordEmbedding are comparing it to the libraries listed below
Sorting:
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆56Updated 4 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Updated 6 years ago
- How Contextual are Contextualized Word Representations?☆42Updated 5 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆200Updated 5 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Updated 3 years ago
- ☆42Updated 2 years ago
- ☆40Updated 6 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆61Updated 3 years ago
- [ACL 2020] Towards Debiasing Sentence Representations☆66Updated 3 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆93Updated 4 months ago
- To analyze and remove gender bias in coreference resolution systems☆78Updated 7 months ago
- A Diagnostic Study of Explainability Techniques for Text Classification☆69Updated 5 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆226Updated 2 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆70Updated 2 years ago
- ☆88Updated 4 years ago
- Code for "Dynamic Contextualized Word Embeddings"☆32Updated 3 years ago
- [ACL 2020] Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation☆27Updated 5 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆49Updated 4 years ago
- A list of publications on NLP interpretability (Welcome PR)☆168Updated 4 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Updated 4 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆95Updated last year
- ☆81Updated 4 years ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆45Updated 6 months ago
- ☆90Updated 3 years ago
- Use WEAT statistic to compare bias among word embeddings trained with different algorithms, from different sources, or after debiasing☆13Updated 6 years ago
- ☆97Updated 3 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆99Updated 3 years ago
- ☆39Updated 4 years ago
- Learning Gender-Neutral Word Embeddings☆47Updated 6 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆194Updated 3 years ago