Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
☆59Oct 14, 2025Updated 5 months ago
Alternatives and similar repositories for hatecheck-data
Users that are interested in hatecheck-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆46May 26, 2025Updated 10 months ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆34Dec 13, 2021Updated 4 years ago
- Code for Blodgett et al. 2016, Demographic dialectal variation in social media☆25Nov 9, 2019Updated 6 years ago
- ☆234Dec 27, 2016Updated 9 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Aug 31, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11May 18, 2021Updated 4 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆111Jun 12, 2023Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆95Jul 21, 2025Updated 8 months ago
- ☆15Apr 10, 2018Updated 8 years ago
- Catalog of abusive language data (PLoS 2020)☆325Jun 14, 2024Updated last year
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)☆25Apr 13, 2023Updated 2 years ago
- ☆44Jun 29, 2023Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Feb 7, 2023Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆75Dec 9, 2022Updated 3 years ago
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Compare coverage across different media sources using the Juicer☆12Apr 1, 2016Updated 10 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- A contextual approach for detecting hate speech code words☆10Jul 30, 2020Updated 5 years ago
- Utilize BERT model for multi task including ABSA (aspect based sentiment analysis) task and AE (Aspect Extraction) task☆10May 31, 2019Updated 6 years ago
- Datasets for Hate Speech Detection☆136May 12, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.☆37May 23, 2016Updated 9 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 7 years ago
- Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"☆32May 31, 2021Updated 4 years ago
- Testing ranking algorithms to improve social cohesion☆32Mar 26, 2025Updated last year
- Code for "Astraea: Grammar-based Fairness Testing"☆10Jan 7, 2022Updated 4 years ago
- A multilingual lexicon of words to hurt.☆95Oct 10, 2025Updated 6 months ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆25Feb 27, 2026Updated last month
- Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection☆38Dec 6, 2020Updated 5 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment☆22Jan 18, 2023Updated 3 years ago
- Understanding attention for text classification☆16Nov 27, 2020Updated 5 years ago
- Backtranslations of IMDB movie reviews for Data Augmentation Purposes☆10Apr 1, 2019Updated 7 years ago
- ☆121May 2, 2024Updated last year
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Traduzindo e adaptando para a realidade brasileira o Bingo dos Dados Abertos - as desculpas mais comuns que você vai ouvir em projetos de…☆12Nov 29, 2019Updated 6 years ago