conversationai / unhealthy-conversations
A corpus of comments tagged for multiple attributes of unhealthiness.
☆34Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for unhealthy-conversations
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆27Updated last year
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last year
- Dynamic ensemble decoding with transformer-based models☆29Updated last year
- This repository hosts the code for a tokenizer of tweets.☆12Updated 5 years ago
- This repository contains papers and resources pertaining to Hate speech research.☆43Updated 3 years ago
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 4 years ago
- ☆40Updated 4 years ago
- ☆17Updated 6 years ago
- ☆22Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 2 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- ☆54Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆86Updated last year
- Statistics on multilingual datasets☆17Updated 2 years ago
- ☆56Updated 3 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆76Updated 7 months ago
- How Contextual are Contextualized Word Representations?☆39Updated 4 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆102Updated 9 months ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 2 years ago
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆15Updated 11 months ago
- ☆26Updated last month
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- A Dataset and Results for Classifying Emotions Across Languages☆10Updated 3 years ago
- annotated hateful speech☆25Updated 5 years ago
- Testing and training detection models for emoji-based hate speech.☆23Updated 2 years ago
- ☆20Updated last year
- On Generating Extended Summaries of Long Documents☆77Updated 3 years ago
- ☆85Updated 2 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Updated 5 years ago