Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
☆59Oct 14, 2025Updated 8 months ago
Alternatives and similar repositories for hatecheck-data
Users that are interested in hatecheck-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆44May 26, 2025Updated last year
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆34Dec 13, 2021Updated 4 years ago
- Code for Blodgett et al. 2016, Demographic dialectal variation in social media☆26Nov 9, 2019Updated 6 years ago
- ☆236Dec 27, 2016Updated 9 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Aug 31, 2022Updated 3 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆112Jun 12, 2023Updated 3 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆95Jul 21, 2025Updated 11 months ago
- ☆15Apr 10, 2018Updated 8 years ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆347Jun 17, 2024Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆324Jun 14, 2024Updated 2 years ago
- ☆10Jul 27, 2018Updated 7 years ago
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)☆26Apr 13, 2023Updated 3 years ago
- Dutch abusive language data☆11Sep 23, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆45Jun 29, 2023Updated 3 years ago
- Testing and training detection models for emoji-based hate speech.☆25May 15, 2022Updated 4 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆75Dec 9, 2022Updated 3 years ago
- Python standalone tokenizer☆14Nov 12, 2015Updated 10 years ago
- Generating global explanations from local ones☆11Nov 11, 2022Updated 3 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆135Mar 1, 2024Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Jun 3, 2026Updated 3 weeks ago
- BERT Fine-tuning for Aspect Based Sentiment Analysis☆29Aug 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A contextual approach for detecting hate speech code words☆10Jul 30, 2020Updated 5 years ago
- Datasets for Hate Speech Detection☆138May 12, 2023Updated 3 years ago
- Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.☆37May 23, 2016Updated 10 years ago
- A multilingual lexicon of words to hurt.☆99Oct 10, 2025Updated 8 months ago
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 7 years ago
- Parallel NDJSON Reader for Python☆17Dec 4, 2019Updated 6 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment☆22Jan 18, 2023Updated 3 years ago
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Annotated corpus of Arabic tweets which mention a violence act.☆10Jun 6, 2018Updated 8 years ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- ☆12Sep 13, 2018Updated 7 years ago
- SmallK: very fast data clustering tools☆13Apr 3, 2019Updated 7 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 3 years ago