Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data
☆60Oct 14, 2025Updated 4 months ago
Alternatives and similar repositories for hatecheck-data
Users that are interested in hatecheck-data are comparing it to the libraries listed below
Sorting:
- ☆10Jul 27, 2018Updated 7 years ago
- ☆234Dec 27, 2016Updated 9 years ago
- ☆15Apr 10, 2018Updated 7 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Dec 9, 2022Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆94Jul 21, 2025Updated 7 months ago
- ☆44Jun 29, 2023Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11May 18, 2021Updated 4 years ago
- Compare coverage across different media sources using the Juicer☆12Apr 1, 2016Updated 9 years ago
- ☆10Aug 31, 2022Updated 3 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 6 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆111Jun 12, 2023Updated 2 years ago
- autoredteam: code for training models that automatically red team other language models☆15Aug 9, 2023Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆323Jun 14, 2024Updated last year
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆345Jun 17, 2024Updated last year
- Python standalone tokenizer☆15Nov 12, 2015Updated 10 years ago
- Netflix for XBMC☆61Nov 13, 2012Updated 13 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20May 14, 2022Updated 3 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆129Mar 1, 2024Updated 2 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 2 weeks ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Aug 28, 2020Updated 5 years ago
- Datasets for Hate Speech Detection☆136May 12, 2023Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆91Feb 12, 2026Updated 2 weeks ago
- Ongoing research training transformer models at scale☆43Updated this week
- ☆55Mar 24, 2022Updated 3 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆25Mar 4, 2025Updated 11 months ago
- The code for the paper "Adversarial Decomposition of Text Representation", NAACL 2019☆29Dec 8, 2022Updated 3 years ago
- Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"☆32May 31, 2021Updated 4 years ago
- ☆119May 2, 2024Updated last year
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"☆32Jun 20, 2023Updated 2 years ago
- OSoMe Twitter tools. Including a package like tweepy but for the v2 Twitter api.☆31Jan 6, 2023Updated 3 years ago
- BERT Fine-tuning for Aspect Based Sentiment Analysis☆29Aug 2, 2022Updated 3 years ago
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆32Nov 11, 2020Updated 5 years ago
- Twitter conversation collection script, which collects all replies to a given tweet☆68Jan 21, 2016Updated 10 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Jun 21, 2022Updated 3 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆86Mar 2, 2021Updated 5 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago