intelligence-csd-auth-gr / Ethos-Hate-Speech-Dataset
This repository contains a dataset for hate speech detection on social media platforms.
☆66Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Ethos-Hate-Speech-Dataset
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆85Updated last year
- Datasets for Hate Speech Detection☆114Updated last year
- A multilingual lexicon of words to hurt.☆79Updated this week
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆188Updated last year
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆107Updated last year
- ☆38Updated last year
- ☆53Updated 2 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆203Updated 3 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- ☆67Updated 3 years ago
- HateEval 2019 - Task 5☆15Updated 5 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆28Updated 5 years ago
- Detect toxic spans in toxic texts☆66Updated last year
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆42Updated 3 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆199Updated 11 months ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆81Updated 3 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆76Updated 6 months ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆79Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆92Updated last year
- Testing and training detection models for emoji-based hate speech.☆23Updated 2 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆129Updated 9 months ago
- ☆85Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆92Updated 2 years ago
- Code for CAET5☆23Updated last year