alatteaday / mrp_hate-speech-detection
☆15Updated last year
Alternatives and similar repositories for mrp_hate-speech-detection:
Users that are interested in mrp_hate-speech-detection are comparing it to the libraries listed below
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆23Updated last year
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆199Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Updated last year
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆115Updated last year
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆28Updated 4 months ago
- Natural Universal Trigger Search (NUTS)☆21Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆77Updated 4 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆135Updated 3 months ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆30Updated 5 months ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆57Updated 2 years ago
- ☆67Updated 4 years ago
- ☆128Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆80Updated 6 months ago
- ☆23Updated 8 months ago
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆25Updated 2 years ago
- ☆25Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Updated 3 years ago
- ☆30Updated 2 years ago
- ☆39Updated 3 years ago
- ☆11Updated 3 years ago
- Official Code for EMNLP 2023 paper: "Unveiling the Implicit Toxicity in Large Language Models""☆11Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- templates and other documents regarding responsible NLP research☆67Updated last year
- Repository for the Bias Benchmark for QA dataset.☆106Updated last year
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆22Updated last year
- Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples (EACL 2021)☆8Updated 3 years ago
- This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…☆19Updated last year
- The code of SKS☆15Updated 3 years ago