hate-alert / HateXplain
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
☆194Updated last year
Alternatives and similar repositories for HateXplain:
Users that are interested in HateXplain are comparing it to the libraries listed below
- This repository contains a dataset for hate speech detection on social media platforms.☆70Updated 2 years ago
- Datasets for Hate Speech Detection☆124Updated last year
- Catalog of abusive language data (PLoS 2020)☆308Updated 8 months ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆108Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆296Updated last year
- NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"☆39Updated 5 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆174Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆168Updated 4 years ago
- A multilingual lexicon of words to hurt.☆83Updated 3 months ago
- Cross-lingual version of WEAT☆9Updated 5 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆198Updated 4 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 3 years ago
- Detect toxic spans in toxic texts☆68Updated last year
- ☆38Updated last year
- A list of publications on NLP interpretability (Welcome PR)☆167Updated 4 years ago
- ☆53Updated 2 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆34Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆57Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Papers on fairness in NLP☆436Updated 9 months ago
- Repository for TweetEval☆363Updated 2 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆205Updated 3 years ago
- ☆230Updated 8 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆80Updated 10 months ago
- ☆38Updated 5 years ago
- ☆68Updated 3 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago