Vicomtech / hate-speech-datasetLinks
Hate speech dataset from Stormfront forum manually labelled at sentence level.
☆175Updated 5 years ago
Alternatives and similar repositories for hate-speech-dataset
Users that are interested in hate-speech-dataset are comparing it to the libraries listed below
Sorting:
- Catalog of abusive language data (PLoS 2020)☆321Updated last year
- ☆234Updated 9 years ago
- A multilingual lexicon of words to hurt.☆92Updated 2 months ago
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated 2 years ago
- Datasets for Hate Speech Detection☆135Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆110Updated 2 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆228Updated 2 years ago
- ☆55Updated 3 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆94Updated 5 months ago
- This repository contains papers and resources pertaining to Hate speech research.☆44Updated 4 years ago
- Datasets for fake news and misinformation detection☆70Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24Updated 3 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- Repository for TweetEval☆390Updated 3 years ago
- Harassment Lexicon and Corpus☆30Updated 7 years ago
- Intersectional bias in hate speech and abusive language datasets☆15Updated last year
- Detect toxic spans in toxic texts☆71Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Updated 4 years ago
- Hate Speech Detection Library for Python.☆194Updated 2 months ago
- ☆15Updated 7 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆110Updated 2 years ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Code for Blodgett et al. 2016, Demographic dialectal variation in social media☆25Updated 6 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Updated 3 years ago
- ☆68Updated 4 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆70Updated 3 years ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 3 years ago
- ☆171Updated 2 years ago
- Repository for the LREC 2022 submission on Emotion Word Dynamics in Geolocated Tweet data.☆104Updated 2 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆183Updated last month