ewsheng / decoding-biasesLinks
Scripts to evaluate various bias metrics for different NLG models + decoding algorithms
☆16Updated last year
Alternatives and similar repositories for decoding-biases
Users that are interested in decoding-biases are comparing it to the libraries listed below
Sorting:
- Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021☆9Updated 3 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated last year
- ☆25Updated 3 years ago
- Dataset + classifier tools to study social perception biases in natural language generation☆69Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Updated 2 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆35Updated last year
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated 2 years ago
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 5 years ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆39Updated 2 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Updated 3 years ago
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Updated 4 years ago
- ☆11Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Updated 2 years ago
- ☆24Updated 10 months ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- ☆31Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…☆15Updated 3 years ago
- Code and data for paper "On the Robustness of Reading Comprehension Models to Entity Renaming" (NAACL'22)☆11Updated 2 years ago
- ☆13Updated last year
- ☆19Updated 3 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆13Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 2 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 5 years ago
- A question-answering dataset with a focus on subjective information☆45Updated last year
- ☆29Updated 3 years ago
- This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search …☆22Updated 4 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Updated 11 months ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆19Updated last year