alatteaday / mrp_hate-speech-detectionLinks
☆15Updated 2 years ago
Alternatives and similar repositories for mrp_hate-speech-detection
Users that are interested in mrp_hate-speech-detection are comparing it to the libraries listed below
Sorting:
- Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT☆205Updated 5 years ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆42Updated last year
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆128Updated last year
- ☆39Updated 3 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆153Updated 5 months ago
- ☆73Updated 5 years ago
- BARTScore: Evaluating Generated Text as Text Generation☆366Updated 3 years ago
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆167Updated 4 years ago
- ☆16Updated 3 years ago
- templates and other documents regarding responsible NLP research☆70Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆85Updated 4 years ago
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆27Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACL☆108Updated last year
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆35Updated 4 years ago
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆24Updated 3 years ago
- in this project, I've implemented the Facebook paper about fine tuning RoBERTa with contrastive loss.☆58Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆56Updated 4 years ago
- ☆17Updated 3 years ago
- Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.☆231Updated 2 years ago
- This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…☆19Updated 2 years ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection☆16Updated 8 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated 2 years ago
- Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"☆32Updated 4 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 4 years ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆33Updated 9 months ago
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆44Updated 4 years ago
- [ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners☆130Updated 3 years ago
- ☆158Updated 2 years ago
- ☆88Updated 2 years ago
- ☆13Updated 2 years ago