nyu-mll/BBQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nyu-mll/BBQ)

nyu-mll / BBQ

Repository for the Bias Benchmark for QA dataset.

☆146

Alternatives and similar repositories for BBQ

Users that are interested in BBQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-science / bold
View on GitHub
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆88Mar 2, 2021Updated 5 years ago
i-gallegos / Fair-LLM-Benchmark
View on GitHub
☆164Sep 12, 2023Updated 2 years ago
McGill-NLP / bias-bench
View on GitHub
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
☆156Aug 18, 2025Updated 11 months ago
moinnadeem / StereoSet
View on GitHub
StereoSet: Measuring stereotypical bias in pretrained language models
☆204Dec 8, 2022Updated 3 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
W4ngatang / sent-bias
View on GitHub
Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.
☆57May 23, 2021Updated 5 years ago
facebookresearch / ResponsibleNLP
View on GitHub
Repository for research in the field of Responsible NLP at Meta.
☆212Apr 18, 2026Updated 3 months ago
frankaging / Causal-Distill
View on GitHub
The Codebase for Causal Distillation for Language Models (NAACL '22)
☆26May 1, 2022Updated 4 years ago
boyiwei / CoTaEval
View on GitHub
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Jul 17, 2024Updated 2 years ago
jaechan-repo / muse_bench
View on GitHub
☆33Aug 9, 2024Updated last year
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
HelloEveryboby / Butler
View on GitHub
Butler 是一个用于自动化服务管理和任务调度的工具项目。
☆17Updated this week
allenai / real-toxicity-prompts
View on GitHub
☆234Feb 23, 2021Updated 5 years ago
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OPTML-Group / SOUL
View on GitHub
Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"
☆30Oct 1, 2024Updated last year
ethz-spylab / unlearning-vs-safety
View on GitHub
☆27Oct 6, 2024Updated last year
arobey1 / advbench
View on GitHub
☆45Mar 3, 2023Updated 3 years ago
timoschick / self-debiasing
View on GitHub
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
☆89Aug 20, 2021Updated 4 years ago
acl-org / ethics-reading-list
View on GitHub
A list of ethics related resources for researchers and practitioners of Natural Language Processing and Computational Linguistics
☆34Oct 20, 2025Updated 9 months ago
rudinger / winogender-schemas
View on GitHub
Data for evaluating gender bias in coreference resolution systems.
☆83May 14, 2019Updated 7 years ago
OPTML-Group / WAGLE
View on GitHub
Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"
☆19Dec 16, 2024Updated last year
vinid / safety-tuned-llamas
View on GitHub
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆95May 9, 2024Updated 2 years ago
ewsheng / nlg-bias
View on GitHub
Dataset + classifier tools to study social perception biases in natural language generation
☆72Jun 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
uclanlp / awesome-fairness-papers
View on GitHub
Papers on fairness in NLP
☆452May 2, 2024Updated 2 years ago
feyzaakyurek / bbnli
View on GitHub
Bias Benchmark for Natural Language Inference. Code repo for the Findings of NAACL 2022 paper "On Measuring Social Biases in Prompt-Based…
☆15Apr 28, 2022Updated 4 years ago
houseme / sensitive-rs
View on GitHub
Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…
☆26Jul 22, 2026Updated last week
MSR-LIT / MultilingualBias
View on GitHub
☆10Jul 6, 2023Updated 3 years ago
NJUPT-SAST / aurora-ui
View on GitHub
🌏 UI component library for the future, based on WebComponent.
☆23Nov 12, 2024Updated last year
theNamek / Bias-Neurons
View on GitHub
☆11Apr 28, 2024Updated 2 years ago
FanZT6 / FairMT-bench
View on GitHub
☆14Mar 7, 2025Updated last year
ruiqi-zhong / nlparam
View on GitHub
Augmenting Statistical Models with Natural Language Parameters
☆28Sep 17, 2024Updated last year
Scarelette / CultureLLM
View on GitHub
☆42Oct 29, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Babelscape / ALERT
View on GitHub
Official repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
☆60Sep 20, 2024Updated last year
microsoft / HiTab
View on GitHub
[ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.
☆109Dec 16, 2025Updated 7 months ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
MilaNLProc / honest
View on GitHub
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
☆21Apr 8, 2025Updated last year
git-disl / Vaccine
View on GitHub
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆51Jan 15, 2026Updated 6 months ago
weiguowilliam / CEAT
View on GitHub
☆21May 1, 2021Updated 5 years ago
myracheng / markedpersonas
View on GitHub
Code and data for Marked Personas (ACL 2023)
☆30May 26, 2023Updated 3 years ago