leondz / hatespeechdata
Catalog of abusive language data (PLoS 2020)
☆309Updated 10 months ago
Alternatives and similar repositories for hatespeechdata:
Users that are interested in hatespeechdata are comparing it to the libraries listed below
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆171Updated 4 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆203Updated last year
- ☆233Updated 8 years ago
- A multilingual lexicon of words to hurt.☆89Updated 5 months ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Pretrained BERT model for analysing COVID-19 Twitter data☆185Updated 2 years ago
- Repository for TweetEval☆372Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆88Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆300Updated last year
- Datasets for Hate Speech Detection☆126Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆71Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆590Updated 9 months ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆58Updated 3 years ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆75Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 7 months ago
- A dataset of millions of news articles scraped from a curated list of data sources.☆392Updated 5 years ago
- ☆166Updated 2 years ago
- Remove problematic gender bias from word embeddings.☆246Updated last year
- Cleans Reddit Text Data☆81Updated 5 years ago
- Datasets for fake news and misinformation detection☆66Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆169Updated last week
- Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021☆37Updated 3 years ago
- Dataset for Emotion Recognition Research☆210Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆106Updated last year
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆352Updated 2 years ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆341Updated 3 months ago
- ☆54Updated 3 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆177Updated 10 months ago