McGill-NLP/bias-bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/McGill-NLP/bias-bench)

McGill-NLP / bias-bench

ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.

☆156

Alternatives and similar repositories for bias-bench

Users that are interested in bias-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princeton-nlp / MABEL
View on GitHub
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆38Dec 14, 2023Updated 2 years ago
moinnadeem / StereoSet
View on GitHub
StereoSet: Measuring stereotypical bias in pretrained language models
☆201Dec 8, 2022Updated 3 years ago
EmpathYang / ADEPT
View on GitHub
Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).
☆15Dec 13, 2024Updated last year
amazon-science / bold
View on GitHub
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆88Mar 2, 2021Updated 5 years ago
Irenehere / Auto-Debias
View on GitHub
☆33May 5, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nyu-mll / crows-pairs
View on GitHub
This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…
☆135Mar 1, 2024Updated 2 years ago
pliang279 / sent_debias
View on GitHub
[ACL 2020] Towards Debiasing Sentence Representations
☆64Nov 21, 2022Updated 3 years ago
i-gallegos / Fair-LLM-Benchmark
View on GitHub
☆160Sep 12, 2023Updated 2 years ago
pliang279 / LM_bias
View on GitHub
[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models
☆61Nov 2, 2022Updated 3 years ago
uclanlp / awesome-fairness-papers
View on GitHub
Papers on fairness in NLP
☆452May 2, 2024Updated 2 years ago
facebookresearch / ResponsibleNLP
View on GitHub
Repository for research in the field of Responsible NLP at Meta.
☆209Apr 18, 2026Updated last month
shauli-ravfogel / nullspace_projection
View on GitHub
☆94Jun 6, 2022Updated 3 years ago
timoschick / self-debiasing
View on GitHub
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
☆89Aug 20, 2021Updated 4 years ago
ewsheng / controllable-nlg-biases
View on GitHub
Framework for controlling demographic biases in NLG (using adversarial prompts)
☆21Jun 12, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
nyu-mll / BBQ
View on GitHub
Repository for the Bias Benchmark for QA dataset.
☆142Jan 8, 2024Updated 2 years ago
ewsheng / decoding-biases
View on GitHub
Scripts to evaluate various bias metrics for different NLG models + decoding algorithms
☆16Dec 6, 2023Updated 2 years ago
kanekomasahiro / evaluate_bias_in_mlm
View on GitHub
☆13Dec 1, 2021Updated 4 years ago
MSR-LIT / MultilingualBias
View on GitHub
☆10Jul 6, 2023Updated 2 years ago
MilaNLProc / honest
View on GitHub
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
☆21Apr 8, 2025Updated last year
uclanlp / gn_glove
View on GitHub
Learning Gender-Neutral Word Embeddings
☆47Oct 3, 2019Updated 6 years ago
McGill-NLP / instruct-qa
View on GitHub
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆87Aug 12, 2024Updated last year
ewsheng / nlg-bias
View on GitHub
Dataset + classifier tools to study social perception biases in natural language generation
☆72Jun 12, 2023Updated 2 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
g8a9 / ear
View on GitHub
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆50May 31, 2022Updated 3 years ago
kanekomasahiro / context-debias
View on GitHub
☆25Feb 6, 2022Updated 4 years ago
kanekomasahiro / bias_eval_in_multiple_mlm
View on GitHub
☆11Jul 7, 2023Updated 2 years ago
HrishikeshVish / Fairpy
View on GitHub
☆25Aug 2, 2024Updated last year
hljoren / compare-embedding-bias
View on GitHub
Use WEAT statistic to compare bias among word embeddings trained with different algorithms, from different sources, or after debiasing
☆13May 28, 2019Updated 7 years ago
rudinger / winogender-schemas
View on GitHub
Data for evaluating gender bias in coreference resolution systems.
☆81May 14, 2019Updated 7 years ago
amazon-science / generalized-fairness-metrics
View on GitHub
☆14Feb 4, 2026Updated 3 months ago
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
HelloEveryboby / Butler
View on GitHub
Butler 是一个用于自动化服务管理和任务调度的工具项目。
☆16May 16, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
chadaeun / weat_replication
View on GitHub
replication of Word Embedding Association Test(WEAT), which is suggested in Semantics derived automatically from language corpora necess…
☆34Aug 2, 2018Updated 7 years ago
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated 11 months ago
xiaoleihuang / Multilingual_Fairness_LREC
View on GitHub
Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.
☆19Dec 8, 2022Updated 3 years ago
PrithivirajDamodaran / vision-language-modelling-series
View on GitHub
Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations
☆14Aug 16, 2022Updated 3 years ago
shaoxia57 / Bias_in_Gendered_Languages
View on GitHub
This is a repo for the EMNLP 19 Paper on gender bias in gendered languages.
☆23Sep 6, 2019Updated 6 years ago
carolinlawrence / gradient-rollback
View on GitHub
Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…
☆21Mar 16, 2021Updated 5 years ago
sail-sg / closer-look-LLM-unlearning
View on GitHub
[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models
☆49Dec 4, 2024Updated last year