hate-alert / HateXplain
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
☆199Updated last year
Alternatives and similar repositories for HateXplain:
Users that are interested in HateXplain are comparing it to the libraries listed below
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Datasets for Hate Speech Detection☆125Updated last year
- Catalog of abusive language data (PLoS 2020)☆309Updated 9 months ago
- Detect toxic spans in toxic texts☆68Updated last year
- Cross-lingual version of WEAT☆9Updated 5 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆71Updated 2 years ago
- NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"☆40Updated 5 years ago
- ☆54Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Updated 3 years ago
- ☆38Updated last year
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆132Updated last year
- StereoSet: Measuring stereotypical bias in pretrained language models☆180Updated 2 years ago
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆199Updated 4 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆28Updated 3 years ago
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection☆74Updated 11 months ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆301Updated last year
- Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021☆37Updated 3 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- Repository for TweetEval☆367Updated 2 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆135Updated 3 months ago
- ☆232Updated 8 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆390Updated 9 months ago
- This is official code for the NAACL 2021 paper: "MelBERT: Metaphor Detection via Contextualized Late Interaction usingMetaphorical Identi…☆47Updated 2 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11Updated 3 years ago
- ☆68Updated 3 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 3 years ago
- ☆67Updated 4 years ago