A multilingual lexicon of words to hurt.
☆95Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for hurtlex
Users that are interested in hurtlex are comparing it to the libraries listed below
Sorting:
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆323Jun 14, 2024Updated last year
- ☆10Aug 31, 2022Updated 3 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆58Nov 26, 2024Updated last year
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆111Jun 12, 2023Updated 2 years ago
- The code of SKS☆15Mar 22, 2022Updated 3 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Mar 14, 2019Updated 6 years ago
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆29Jul 21, 2023Updated 2 years ago
- Datasets for Hate Speech Detection☆136May 12, 2023Updated 2 years ago
- ☆68Oct 28, 2021Updated 4 years ago
- Python Version of Andrew Welter's Hatebase Wrapper☆10Feb 20, 2022Updated 4 years ago
- A repository for resources relating to NLP in the Balochi language☆19Jun 3, 2023Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆94Jul 21, 2025Updated 7 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆234Jun 12, 2023Updated 2 years ago
- A generalized input-label embedding for text classification☆24Dec 6, 2019Updated 6 years ago
- Interpretable feature construction from taxonomies for text classification☆18Apr 4, 2022Updated 3 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- Deploying a Deep Learning model on Heroku Server☆10Dec 8, 2022Updated 3 years ago
- This project scrapes the entire public history of a Reddit user given their username☆14Dec 8, 2022Updated 3 years ago
- This repository contains a dataset for hate speech detection on social media platforms.☆74Dec 9, 2022Updated 3 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- This is for C2D2 Dataset: A Resource for Analyzing Cognitive Distortions and Its Impact on Mental Health☆33Nov 10, 2023Updated 2 years ago
- Harassment Lexicon and Corpus☆30May 22, 2018Updated 7 years ago
- Notes on papers in Natural Language Processing, Computational Linguistics, and the related sciences☆14Updated this week
- A corpus of comments tagged for multiple attributes of unhealthiness.☆37Mar 25, 2021Updated 4 years ago
- ☆16Dec 8, 2022Updated 3 years ago
- ☆14Dec 30, 2022Updated 3 years ago
- The Hateful Memes Challenge example code using MMF☆13Aug 25, 2020Updated 5 years ago
- The official repo for the Dialz Python library - a toolkit for steering vector research.☆22Jul 9, 2025Updated 8 months ago
- Implementation for EACL 2021 paper "Scientific Discourse Tagging for Evidence Extraction".☆20Sep 23, 2021Updated 4 years ago
- MetaCOVID: META-Coronavrius dataset repository☆37May 3, 2021Updated 4 years ago
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆20Jun 13, 2025Updated 8 months ago
- Matlab (and C) implementation of Dependency-LDA, Prior-LDA and Flat-LDA models for multi-label document classification☆17Aug 2, 2016Updated 9 years ago
- Using GPT-3 to detect hate speech that contains sexist and racist content☆24Nov 11, 2025Updated 3 months ago
- A framework-agnostic datasets library for Machine Learning research and education.☆18Dec 8, 2022Updated 3 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Aug 20, 2021Updated 4 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 3 weeks ago