XuhuiZhou/Toxic_Debias

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XuhuiZhou/Toxic_Debias)

XuhuiZhou / Toxic_Debias

code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A. Smith and Yejin Choi

☆20

Alternatives and similar repositories for Toxic_Debias

Users that are interested in Toxic_Debias are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SALT-NLP / implicit-hate
View on GitHub
☆46Jun 29, 2023Updated 3 years ago
swabhs / notebooks_for_aflite
View on GitHub
IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".
☆16Aug 14, 2020Updated 5 years ago
uds-lsv / lexicon-of-abusive-words
View on GitHub
This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…
☆29Mar 14, 2019Updated 7 years ago
anuradha1992 / HEAL
View on GitHub
Code and the dataset for HEAL: A Knowledge Graph for Distress Management Conversations
☆23Nov 5, 2024Updated last year
BrendanKennedy / contextualizing-hate-speech-models-with-explanations
View on GitHub
Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"
☆35Dec 13, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
abaheti95 / ToxiChat
View on GitHub
Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…
☆17Jul 27, 2023Updated 2 years ago
dhfbk / twitter-abusive-context-dataset
View on GitHub
☆10Aug 31, 2022Updated 3 years ago
Zheng321 / Deep-Reinforcement-Learning-for-Cost-Effective-Medical-Diagnosis
View on GitHub
This repo contains the core codes for the paper "Deep Reinforcement Learning for Cost-Effective Medical Diagnosis".
☆14Apr 7, 2023Updated 3 years ago
chrisc36 / debias
View on GitHub
Methods of training NLP models to ignored biased strategies
☆55May 22, 2023Updated 3 years ago
RiTUAL-MBZUAI / Font_LDL_2020
View on GitHub
This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"
☆12Nov 21, 2022Updated 3 years ago
judy2k / crochet-cad
View on GitHub
A bunch of scripts to generate useful shapes in crochet.
☆19May 6, 2018Updated 8 years ago
bvidgen / Dynamically-Generated-Hate-Speech-Dataset
View on GitHub
Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).
☆44May 26, 2025Updated last year
1783696285 / SKS
View on GitHub
The code of SKS
☆15Mar 22, 2022Updated 4 years ago
hate-alert / HateXplain
View on GitHub
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
☆248Jun 12, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OFAI / million-post-corpus
View on GitHub
Annotated data set consisting of user comments posted to a German-language newspaper website
☆18Jun 28, 2018Updated 8 years ago
prakharguptaz / target-guided-dialogue-coda
View on GitHub
Code for paper Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation
☆14Jun 10, 2022Updated 4 years ago
zkx06111 / ReDiffusion
View on GitHub
☆17Jul 25, 2023Updated 2 years ago
utahnlp / x-fact
View on GitHub
Official Code and Data repository of our ACL 2021 paper X-FACT: A New Benchmark Dataset for Multilingual Fact Checking.
☆28Oct 4, 2024Updated last year
tommasoc80 / AbuseEval
View on GitHub
Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"
☆19Sep 23, 2023Updated 2 years ago
xiaoleihuang / Multilingual_Fairness_LREC
View on GitHub
Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.
☆19Dec 8, 2022Updated 3 years ago
lisaalaz / satbot
View on GitHub
An empathetic counselling chatbot. Retrieval-based, uses finetuned LMs for emotion identification and to boost empathy, novelty and fluen…
☆18Jun 8, 2023Updated 3 years ago
MinhDucBui / Multi3Hate
View on GitHub
☆15Jan 6, 2025Updated last year
richardsun-voyager / UAFTC
View on GitHub
Understanding attention for text classification
☆16Nov 27, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
GChrysostomou / ood_faith
View on GitHub
☆13Jul 26, 2023Updated 2 years ago
dut-laowang / PCLMM
View on GitHub
The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…
☆16Apr 3, 2025Updated last year
codertimo / python-template
View on GitHub
python project template for personal projects! 🙋‍♀️
☆11Nov 28, 2020Updated 5 years ago
Abhishek0697 / Detection-of-Hate-Speech-in-Multimodal-Memes
View on GitHub
Facebook Hatebook Memes Challenge
☆12Jan 28, 2021Updated 5 years ago
BinWang28 / FacEval
View on GitHub
EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization
☆13Mar 20, 2025Updated last year
DCSaunders / gender-debias
View on GitHub
Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…
☆13Mar 18, 2021Updated 5 years ago
sjtuprog / fox-news-comments
View on GitHub
annotated hateful speech
☆24Apr 6, 2019Updated 7 years ago
jianglongye / featurenerf
View on GitHub
FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023
☆13Jul 13, 2024Updated 2 years ago
Cohere-Labs-Community / goodtriever
View on GitHub
Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"
☆25May 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
videohatespeech / Implicit_Video_Hate
View on GitHub
☆17Aug 4, 2025Updated 11 months ago
INK-USC / hierarchical-explanation-neural-sequence-models
View on GitHub
Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.
☆29Jun 28, 2020Updated 6 years ago
HKUST-KnowComp / MLMA_hate_speech
View on GitHub
Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"
☆58Nov 26, 2024Updated last year
SAP-archive / acl2022-self-contrastive-decorrelation
View on GitHub
Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".
☆26Mar 10, 2025Updated last year
xhan77 / veiled-toxicity-detection
View on GitHub
Fortifying Toxic Speech Detectors Against Veiled Toxicity
☆11Oct 21, 2020Updated 5 years ago
facebookresearch / ResponsibleNLP
View on GitHub
Repository for research in the field of Responsible NLP at Meta.
☆212Apr 18, 2026Updated 3 months ago
ozyyshr / ShareGPT_investigation
View on GitHub
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions (EMNLP 2023))
☆13Dec 21, 2023Updated 2 years ago