ewsheng / decoding-biases
Scripts to evaluate various bias metrics for different NLG models + decoding algorithms
☆16Updated last year
Alternatives and similar repositories for decoding-biases:
Users that are interested in decoding-biases are comparing it to the libraries listed below
- Dataset + classifier tools to study social perception biases in natural language generation☆66Updated last year
- Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021☆9Updated 3 years ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆12Updated 2 years ago
- ☆15Updated 3 years ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated last year
- ☆25Updated 3 years ago
- ☆31Updated last year
- ☆29Updated 3 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 7 months ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated 2 years ago
- ☆38Updated last year
- ☆19Updated 2 years ago
- Placeholder repository☆14Updated 2 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆20Updated 2 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated last year
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆36Updated last year
- ☆10Updated 3 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Updated 2 years ago
- ☆23Updated 5 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆16Updated last year
- Symmetric evaluation set based on the FEVER (fact verification) dataset☆52Updated 3 years ago
- ☆47Updated 2 years ago
- Official Github repo for the paper "Evaluating the Evaluation of Diversity in Natural Language Generation"☆19Updated 3 years ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆33Updated 11 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year