leondz / hatespeechdataLinks
Catalog of abusive language data (PLoS 2020)
☆314Updated last year
Alternatives and similar repositories for hatespeechdata
Users that are interested in hatespeechdata are comparing it to the libraries listed below
Sorting:
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆174Updated 5 years ago
- A multilingual lexicon of words to hurt.☆89Updated 2 weeks ago
- Repository for TweetEval☆379Updated 3 years ago
- ☆234Updated 8 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆210Updated 2 years ago
- Datasets for Hate Speech Detection☆130Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated last year
- Pretrained BERT model for analysing COVID-19 Twitter data☆184Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆594Updated 11 months ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆89Updated 2 years ago
- Dataset for Emotion Recognition Research☆212Updated 2 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆234Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆109Updated last year
- This repository contains a dataset for hate speech detection on social media platforms.☆73Updated 2 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Updated 6 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- ☆54Updated 3 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆179Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆359Updated 2 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆170Updated 3 months ago
- Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenizati…☆670Updated last month
- This is a simple Python package for calculating a variety of lexical diversity indices☆77Updated last year
- This repository contains papers and resources pertaining to Hate speech research.☆45Updated 4 years ago
- Detect toxic spans in toxic texts☆69Updated 2 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆59Updated 3 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆816Updated 2 years ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆157Updated 2 years ago
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large c…☆585Updated last week
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆342Updated last month