Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
☆59Oct 14, 2025Updated 6 months ago
Alternatives and similar repositories for hatecheck-data
Users that are interested in hatecheck-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆46May 26, 2025Updated 11 months ago
- ☆235Dec 27, 2016Updated 9 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- ☆10Aug 31, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11May 18, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆95Jul 21, 2025Updated 9 months ago
- ☆15Apr 10, 2018Updated 8 years ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆345Jun 17, 2024Updated last year
- Catalog of abusive language data (PLoS 2020)☆326Jun 14, 2024Updated last year
- ☆10Jul 27, 2018Updated 7 years ago
- Dutch abusive language data☆11Sep 23, 2023Updated 2 years ago
- ☆44Jun 29, 2023Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- Command-line tool for building Gephi force-directed graph diagrams.☆10Nov 10, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆20Feb 7, 2023Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆75Dec 9, 2022Updated 3 years ago
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- Generating global explanations from local ones☆11Nov 11, 2022Updated 3 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Mar 30, 2026Updated last month
- BERT Fine-tuning for Aspect Based Sentiment Analysis☆29Aug 2, 2022Updated 3 years ago
- A contextual approach for detecting hate speech code words☆10Jul 30, 2020Updated 5 years ago
- Datasets for Hate Speech Detection☆136May 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆238Jun 12, 2023Updated 2 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 7 years ago
- Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"☆32May 31, 2021Updated 4 years ago
- Code for "Astraea: Grammar-based Fairness Testing"☆10Jan 7, 2022Updated 4 years ago
- A multilingual lexicon of words to hurt.☆96Oct 10, 2025Updated 6 months ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆25Feb 27, 2026Updated 2 months ago
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 7 years ago
- Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection☆38Dec 6, 2020Updated 5 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 3 years ago
- SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment☆22Jan 18, 2023Updated 3 years ago
- Understanding attention for text classification☆16Nov 27, 2020Updated 5 years ago
- A demo of the vanishing gradient problem in a simple fully connected network classifying MNIST images.☆15Jan 16, 2018Updated 8 years ago
- ☆123May 2, 2024Updated 2 years ago
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- Boosting Synthetic Data Generation with Effective Nonlinear Causal Discovery☆17Sep 17, 2023Updated 2 years ago