This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-Based Approach" by Michael Wiegand, Josef Ruppenhofer, Anna Schmidt and Clayton Greenberg.
☆29Mar 14, 2019Updated 7 years ago
Alternatives and similar repositories for lexicon-of-abusive-words
Users that are interested in lexicon-of-abusive-words are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆20Aug 20, 2021Updated 4 years ago
- Framework for training dependency parsing models.☆12Jun 12, 2024Updated last year
- ☆17Jan 21, 2025Updated last year
- ☆55Mar 24, 2022Updated 4 years ago
- Variationist: Exploring Multifaceted Variation and Bias in Written Language Data (ACL 2024 demo track)☆10Jan 31, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Apr 10, 2018Updated 8 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Feb 14, 2019Updated 7 years ago
- Code for "Dynamic Contextualized Word Embeddings"☆33Dec 30, 2021Updated 4 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆40Nov 13, 2025Updated 5 months ago
- annotated hateful speech☆24Apr 6, 2019Updated 7 years ago
- This repo contains the dataset and description for Ruddit and its variants.☆36Feb 13, 2022Updated 4 years ago
- A multilingual lexicon of words to hurt.☆98Oct 10, 2025Updated 7 months ago
- ☆10Aug 31, 2022Updated 3 years ago
- ☆15Apr 9, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- The code of SKS☆15Mar 22, 2022Updated 4 years ago
- SemEval 2019 - Task 6 - Identifying and Categorizing Offensive Language in Social Media☆26Feb 26, 2019Updated 7 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Assessing syntactic abilities of BERT☆149May 23, 2019Updated 6 years ago
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆175May 22, 2020Updated 5 years ago
- POSIX: A Prompt Sensitivity Index for Language Models☆13Nov 13, 2024Updated last year
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆15Aug 27, 2024Updated last year
- This repository contains papers and resources pertaining to Hate speech research.☆44May 30, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Harassment Lexicon and Corpus☆30May 22, 2018Updated 7 years ago
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Download and load spaCy models on-the-fly☆15Feb 9, 2023Updated 3 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆19Sep 23, 2023Updated 2 years ago
- Data from the Sequoia treebank.☆11Updated this week
- 2018 Duolingo Shared Task on Second Language Acquisition Modeling (SLAM) (http://sharedtask.duolingo.com/)☆12May 31, 2018Updated 7 years ago
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Jun 28, 2018Updated 7 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated this week
- An automatically annotated sentiment analysis dataset of product reviews in Russian.☆17Oct 25, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- Benchmark Datasets for BioNLP Tasks☆17May 7, 2025Updated last year
- This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.☆15Sep 17, 2018Updated 7 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- ☆13Jan 12, 2021Updated 5 years ago