paul-rottger / hatecheck-data
Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
☆56Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for hatecheck-data
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆86Updated last year
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆27Updated last year
- This repository contains papers and resources pertaining to Hate speech research.☆43Updated 3 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last year
- ☆38Updated last year
- NAACL 2019 (Oral): Code for "Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings"☆38Updated 5 years ago
- ☆67Updated 3 years ago
- Learning Gender-Neutral Word Embeddings☆46Updated 5 years ago
- A multilingual lexicon of words to hurt.☆80Updated 2 weeks ago
- ☆53Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆76Updated 7 months ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 2 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated last year
- ☆40Updated 4 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated last year
- ☆54Updated 2 years ago
- ☆17Updated 6 years ago
- ☆85Updated 2 years ago
- ☆57Updated last year
- Harassment Lexicon and Corpus☆27Updated 6 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Updated 5 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 2 years ago
- ☆50Updated 8 months ago
- Cross-lingual version of WEAT☆9Updated 5 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- ☆19Updated 2 years ago
- Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"☆21Updated 2 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆102Updated 9 months ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago