code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A. Smith and Yejin Choi
☆19Aug 20, 2021Updated 4 years ago
Alternatives and similar repositories for Toxic_Debias
Users that are interested in Toxic_Debias are comparing it to the libraries listed below
Sorting:
- ☆44Jun 29, 2023Updated 2 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- Official code release for ACL 2020 paper "Contextualizing Hate Speech Classifiers with Post hoc Explanation"☆35Dec 13, 2021Updated 4 years ago
- The code of SKS☆15Mar 22, 2022Updated 3 years ago
- ☆17Jul 25, 2023Updated 2 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Sep 23, 2023Updated 2 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆19Dec 8, 2022Updated 3 years ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆46May 26, 2025Updated 9 months ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆234Jun 12, 2023Updated 2 years ago
- Testing and training detection models for emoji-based hate speech.☆24May 15, 2022Updated 3 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 6 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆58Nov 26, 2024Updated last year
- Methods of training NLP models to ignored biased strategies☆55May 22, 2023Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆111Jun 12, 2023Updated 2 years ago
- Source code for ACL 2022 paper "Self-contrastive Decorrelation for Sentence Embeddings".☆26Mar 10, 2025Updated 11 months ago
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆30Jun 28, 2020Updated 5 years ago
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆29Jul 21, 2023Updated 2 years ago
- Create awesome games with GPT☆32Mar 21, 2023Updated 2 years ago
- Official Code and Data repository of our ACL 2021 paper X-FACT: A New Benchmark Dataset for Multilingual Fact Checking.☆27Oct 4, 2024Updated last year
- ☆32Apr 24, 2024Updated last year
- Introduction to Algorithms, Third Edition.☆10Apr 2, 2017Updated 8 years ago
- ☆12Dec 14, 2022Updated 3 years ago
- ☆11Jun 18, 2023Updated 2 years ago
- Collection of talks given in the ML reading group@IIITD☆11Mar 31, 2021Updated 4 years ago
- Fortifying Toxic Speech Detectors Against Veiled Toxicity☆11Oct 21, 2020Updated 5 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- This repo contains the dataset and description for Ruddit and its variants.☆36Feb 13, 2022Updated 4 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- A course in numerical methods with Python for engineers and scientists: currently 5 learning modules, with student assignments.☆10Dec 6, 2017Updated 8 years ago
- Word embeddings trained on medical subreddits.☆10Jan 4, 2021Updated 5 years ago
- Full List of Bad Words and Top Swear Words Banned by Google. As they closed the api☆12Sep 26, 2018Updated 7 years ago
- This project uses gpt-4 to build agents to play one night werewolf.☆10Jul 14, 2023Updated 2 years ago