ewsheng / decoding-biasesLinks

Scripts to evaluate various bias metrics for different NLG models + decoding algorithms

☆16

Alternatives and similar repositories for decoding-biases

Users that are interested in decoding-biases are comparing it to the libraries listed below

Sorting:

iitmnlp / EvalEval
Perturbation CheckLists for Evaluating NLG Evaluation Metrics, EMNLP 2021
☆9Updated 3 years ago
phosseini / GisPy
GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/
☆12Updated last year
kanekomasahiro / context-debias
☆25Updated 3 years ago
ewsheng / nlg-bias
Dataset + classifier tools to study social perception biases in natural language generation
☆69Updated 2 years ago
sabithsn / APPDIA-Discourse-Style-Transfer
Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…
☆13Updated 2 years ago
AIPHES / DiscoScore
DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence
☆35Updated last year
allenai / few_shot_explanations
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆31Updated 2 years ago
google-research-datasets / lareqa
LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…
☆14Updated 5 years ago
meetdavidwan / factpegasus
PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)
☆39Updated 2 years ago
nlp-waseda / mtl-eadrg
Emotion-Aware Dialogue Response Generation by Multi-Task Learning
☆13Updated 3 years ago
atcbosselut / scs-baselines
Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…
☆16Updated 4 years ago
eval4nlp / SharedTask2023
☆11Updated last year
EagleW / Stage-wise-Fine-tuning
Code for Stage-wise Fine-tuning for Graph-to-Text Generation
☆26Updated 2 years ago
Yifan-Gao / open_retrieval_conversational_machine_reading
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Updated 2 years ago
allenai / dream
☆24Updated 10 months ago
davidheineman / thresh
🌾 Universal, customizable and deployable fine-grained evaluation for text generation.
☆23Updated last year
allenai / tailor
☆31Updated last year
benpry / chain-of-thought-metaphor
This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…
☆14Updated 2 years ago
feyzaakyurek / bbnli
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Updated 3 years ago
INK-USC / entity-robustness
Code and data for paper "On the Robustness of Reading Comprehension Models to Entity Renaming" (NAACL'22)
☆11Updated 2 years ago
GChrysostomou / ood_faith
☆13Updated last year
allenai / ACCoRD
☆19Updated 3 years ago
thunlp / CSS-LM
CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models
☆13Updated 2 years ago
sunlab-osu / ReasonBERT
Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021
☆29Updated 2 years ago
timoschick / form-context-model
This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.
☆31Updated 5 years ago
megagonlabs / SubjQA
A question-answering dataset with a focus on subjective information
☆45Updated last year
jungokasai / twist_decoding
☆29Updated 3 years ago
PengjieRen / CaSE_WISE
This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search …
☆22Updated 4 years ago
martiansideofthemoon / longeval-summarization
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆44Updated 11 months ago
SALT-NLP / mic
Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"
☆19Updated last year