hate-alert / HateXplainLinks
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
☆226Updated 2 years ago
Alternatives and similar repositories for HateXplain
Users that are interested in HateXplain are comparing it to the libraries listed below
Sorting:
- Catalog of abusive language data (PLoS 2020)☆320Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated 2 years ago
- Repository for TweetEval☆389Updated 3 years ago
- A list of publications on NLP interpretability (Welcome PR)☆168Updated 4 years ago
- Detect toxic spans in toxic texts☆71Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆110Updated 2 years ago
- ☆42Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆93Updated 4 months ago
- NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"☆41Updated 6 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆194Updated 3 years ago
- Datasets for Hate Speech Detection☆134Updated 2 years ago
- Papers on fairness in NLP☆451Updated last year
- A multilingual lexicon of words to hurt.☆92Updated last month
- Awesome Neural Adaptation in Natural Language Processing. A curated list. https://arxiv.org/abs/2006.00632☆265Updated 4 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Updated 3 years ago
- ☆55Updated 3 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆200Updated 5 years ago
- ☆234Updated 8 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Updated 4 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175Updated 5 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆137Updated last year
- Dataset + classifier tools to study social perception biases in natural language generation☆70Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆95Updated last year
- Pytorch Implementation of GoEmotions 😍😢😱☆161Updated 2 years ago
- ☆60Updated 4 years ago
- Data and code for the paper COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic.☆38Updated 9 months ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆45Updated 6 months ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆71Updated 3 years ago
- Research framework for low resource text classification that allows the user to experiment with classification models and active learning…☆101Updated 3 years ago