Datasets for Hate Speech Detection
☆138May 12, 2023Updated 3 years ago
Alternatives and similar repositories for Datasets-for-Hate-Speech-Detection
Users that are interested in Datasets-for-Hate-Speech-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆245Jun 12, 2023Updated 3 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆844Jun 12, 2023Updated 3 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆112Jun 12, 2023Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆324Jun 14, 2024Updated 2 years ago
- A multilingual lexicon of words to hurt.☆99Oct 10, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for CAET5☆23Jun 12, 2023Updated 3 years ago
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 4 years ago
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- SemEval 2019 - Task 6 - Identifying and Categorizing Offensive Language in Social Media☆26Feb 26, 2019Updated 7 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆75Dec 9, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Experimental Code☆11May 18, 2021Updated 5 years ago
- code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"☆16Aug 15, 2022Updated 3 years ago
- This is a python project that is used to identify hate speech in tweets. The dataset used to train the model is available on Kaggle and c…☆39Apr 3, 2022Updated 4 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆95Jul 21, 2025Updated 10 months ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆59Oct 14, 2025Updated 8 months ago
- Dataset accompanying the paper "Investigating African-American Vernacular English in Transformer-Based Text Generation."☆10Apr 8, 2022Updated 4 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Ml model to detect hate speech and offensive language☆11May 25, 2021Updated 5 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 7 years ago
- Tools for querying various name-based gender inference services and evaluate them.☆10Dec 7, 2022Updated 3 years ago
- A collection of Korean Text Datasets ready to use using Tensorflow-Datasets.☆20Jun 8, 2022Updated 4 years ago
- Bert language model for hate speech detection.☆21Aug 6, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated last year
- Religious Hate Speech Detection for Arabic Tweets☆26Feb 1, 2019Updated 7 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 4 months ago
- A Hierarchically-Labeled Portuguese Hate Speech Dataset☆35Jun 25, 2019Updated 6 years ago
- ☆13Apr 24, 2024Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 4 years ago
- A nlp framework to find hate speech comments out of a comments corpus.☆11Dec 8, 2022Updated 3 years ago
- Detect toxic spans in toxic texts☆70Jun 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆44May 26, 2025Updated last year
- Turkish and English Dataset from "Large-Scale Hate Speech Detection with Cross-Domain Transfer"☆31Oct 11, 2023Updated 2 years ago
- Hate Speech Detection Library for Python.☆195Oct 26, 2025Updated 7 months ago
- A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media☆43Jan 28, 2022Updated 4 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- Developing a classification model to detect hate tweets ready for deployment using various NLP techniques☆19Oct 7, 2024Updated last year
- Repo for MCMC based Dynamic Topic Model☆16Sep 2, 2017Updated 8 years ago