code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A. Smith and Yejin Choi
☆20Aug 20, 2021Updated 4 years ago
Alternatives and similar repositories for Toxic_Debias
Users that are interested in Toxic_Debias are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- ☆44Jun 29, 2023Updated 2 years ago
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Mar 14, 2019Updated 7 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Methods of training NLP models to ignored biased strategies☆55May 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- Röttger et al. (ACL 2021): "HateCheck: Functional Tests for Hate Speech Detection Models" - Data☆59Oct 14, 2025Updated 6 months ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆46May 26, 2025Updated 10 months ago
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆238Jun 12, 2023Updated 2 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Sep 23, 2023Updated 2 years ago
- ☆17Jul 25, 2023Updated 2 years ago
- Understanding attention for text classification☆16Nov 27, 2020Updated 5 years ago
- ☆14Jan 6, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Jul 26, 2023Updated 2 years ago
- Official Code and Data repository of our ACL 2021 paper X-FACT: A New Benchmark Dataset for Multilingual Fact Checking.☆27Oct 4, 2024Updated last year
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆16Apr 3, 2025Updated last year
- FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023☆13Jul 13, 2024Updated last year
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- An empathetic counselling chatbot. Retrieval-based, uses finetuned LMs for emotion identification and to boost empathy, novelty and fluen…