hate-alert / HateXplain
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
☆194Updated last year
Alternatives and similar repositories for HateXplain:
Users that are interested in HateXplain are comparing it to the libraries listed below
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆108Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆69Updated 2 years ago
- Datasets for Hate Speech Detection☆120Updated last year
- A repo to explore different NLP tasks which can be solved using T5☆170Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆308Updated 7 months ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆133Updated 11 months ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆169Updated 2 years ago
- ☆39Updated last year
- Detect toxic spans in toxic texts☆68Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆291Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆87Updated last year
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆198Updated 4 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆382Updated 6 months ago
- A multilingual lexicon of words to hurt.☆82Updated 2 months ago
- ☆66Updated 4 years ago
- Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021☆36Updated 2 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆33Updated 3 years ago
- A list of publications on NLP interpretability (Welcome PR)☆167Updated 4 years ago
- Pytorch Implementation of GoEmotions 😍😢😱☆153Updated last year
- NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"☆38Updated 5 years ago
- Repository for TweetEval☆363Updated 2 years ago
- Multimodal Meme Classification: Identifying Offensive Content in Image and Text☆69Updated 2 years ago
- ☆53Updated 2 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆165Updated 4 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆80Updated 2 years ago
- ☆228Updated 8 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆132Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆78Updated 9 months ago
- Cross-lingual version of WEAT☆9Updated 5 years ago