amazon-science/bold

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amazon-science/bold)

amazon-science / bold

Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper

☆88

Alternatives and similar repositories for bold

Users that are interested in bold are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

McGill-NLP / bias-bench
View on GitHub
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
☆156Aug 18, 2025Updated 11 months ago
allenai / unqover
View on GitHub
UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)
☆20Jul 6, 2021Updated 5 years ago
nyu-mll / BBQ
View on GitHub
Repository for the Bias Benchmark for QA dataset.
☆146Jan 8, 2024Updated 2 years ago
nyu-mll / crows-pairs
View on GitHub
This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…
☆137Mar 1, 2024Updated 2 years ago
moinnadeem / StereoSet
View on GitHub
StereoSet: Measuring stereotypical bias in pretrained language models
☆204Dec 8, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uclanlp / awesome-fairness-papers
View on GitHub
Papers on fairness in NLP
☆452May 2, 2024Updated 2 years ago
zhliu0106 / learning-to-refuse
View on GitHub
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
☆10Dec 13, 2024Updated last year
ewsheng / controllable-nlg-biases
View on GitHub
Framework for controlling demographic biases in NLG (using adversarial prompts)
☆21Jun 12, 2023Updated 3 years ago
lucy3 / gpt3_gender
View on GitHub
Narrative Understanding Workshop paper (2021) on gender in GPT-3 generated stories
☆14May 28, 2021Updated 5 years ago
timoschick / self-debiasing
View on GitHub
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
☆89Aug 20, 2021Updated 4 years ago
boyiwei / CoTaEval
View on GitHub
[NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models
☆17Jul 17, 2024Updated 2 years ago
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
Lslland / T-Vaccine
View on GitHub
☆19Jun 21, 2025Updated last year
OPTML-Group / WAGLE
View on GitHub
Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"
☆19Dec 16, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
houseme / sensitive-rs
View on GitHub
Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…
☆26Updated this week
allenai / real-toxicity-prompts
View on GitHub
☆233Feb 23, 2021Updated 5 years ago
Thartvigsen / GRACE
View on GitHub
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆86Dec 21, 2024Updated last year
tailequy / fairness_dataset
View on GitHub
Datasets for fairness-aware machine learning
☆13Mar 4, 2025Updated last year
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆54Nov 15, 2023Updated 2 years ago
zjunlp / BiasEdit
View on GitHub
[TrustNLP@NAACL 2025] BiasEdit: Debiasing Stereotyped Language Models via Model Editing
☆18Sep 30, 2025Updated 9 months ago
vinid / safety-tuned-llamas
View on GitHub
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆95May 9, 2024Updated 2 years ago
facebookresearch / ResponsibleNLP
View on GitHub
Repository for research in the field of Responsible NLP at Meta.
☆212Apr 18, 2026Updated 3 months ago
uclanlp / gn_glove
View on GitHub
Learning Gender-Neutral Word Embeddings
☆47Oct 3, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SongW-SW / CEB
View on GitHub
☆15Jun 25, 2025Updated last year
hendrycks / ethics
View on GitHub
Aligning AI With Shared Human Values (ICLR 2021)
☆325Apr 21, 2023Updated 3 years ago
shauli-ravfogel / nullspace_projection
View on GitHub
☆95Jun 6, 2022Updated 4 years ago
paul-rottger / xstest
View on GitHub
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆138Feb 24, 2025Updated last year
1andrevich / antifilter-domain
View on GitHub
Generated geosite.dat based on Antifilter Community List
☆29Updated this week
jaechan-repo / muse_bench
View on GitHub
☆33Aug 9, 2024Updated last year
MilaNLProc / honest
View on GitHub
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
☆21Apr 8, 2025Updated last year
sabithsn / APPDIA-Discourse-Style-Transfer
View on GitHub
Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…
☆13Sep 8, 2022Updated 3 years ago
microsoft / HiTab
View on GitHub
[ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.
☆109Dec 16, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
graldij / transformer-fusion
View on GitHub
Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.
☆31Apr 19, 2024Updated 2 years ago
rudinger / winogender-schemas
View on GitHub
Data for evaluating gender bias in coreference resolution systems.
☆82May 14, 2019Updated 7 years ago
xiaoleihuang / Multilingual_Fairness_LREC
View on GitHub
Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.
☆19Dec 8, 2022Updated 3 years ago
minnesotanlp / cobbler
View on GitHub
Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
☆23Feb 16, 2024Updated 2 years ago
sail-sg / closer-look-LLM-unlearning
View on GitHub
[ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models
☆49Dec 4, 2024Updated last year
PlusLabNLP / PredictiveEngagement
View on GitHub
Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
☆16Jun 8, 2021Updated 5 years ago
centerforaisafety / wmdp
View on GitHub
WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…
☆176May 29, 2025Updated last year