nyu-mll / BBQ
Repository for the Bias Benchmark for QA dataset.
☆98Updated last year
Alternatives and similar repositories for BBQ:
Users that are interested in BBQ are comparing it to the libraries listed below
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆70Updated 3 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆109Updated 10 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆130Updated last month
- ☆119Updated last year
- ☆25Updated 11 months ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆20Updated 3 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Codebase, data and models for the SummaC paper in TACL☆87Updated 3 weeks ago
- Text generation using language models with multiple exit heads☆15Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- ☆44Updated last year
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆31Updated last year
- Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆79Updated last year
- ☆24Updated 4 months ago
- ☆46Updated last year
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆12Updated 11 months ago
- ☆70Updated last year
- Data for evaluating gender bias in coreference resolution systems.☆72Updated 5 years ago
- ☆57Updated last month
- ☆51Updated 2 months ago
- ☆81Updated last year
- ☆75Updated last year
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆20Updated last year
- ☆30Updated 8 months ago
- ☆42Updated last year
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆22Updated 2 months ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆87Updated 3 years ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- ☆100Updated 8 months ago