Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆88Mar 2, 2021Updated 5 years ago
Alternatives and similar repositories for bold
Users that are interested in bold are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dataset + classifier tools to study social perception biases in natural language generation☆72Jun 12, 2023Updated 2 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆156Aug 18, 2025Updated 9 months ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆21Jul 6, 2021Updated 4 years ago
- Repository for the Bias Benchmark for QA dataset.☆142Jan 8, 2024Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆135Mar 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- StereoSet: Measuring stereotypical bias in pretrained language models☆201Dec 8, 2022Updated 3 years ago
- Papers on fairness in NLP☆453May 2, 2024Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- Narrative Understanding Workshop paper (2021) on gender in GPT-3 generated stories☆14May 28, 2021Updated 4 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆89Aug 20, 2021Updated 4 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Generated geosite.dat based on Antifilter Community List☆27Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆44Oct 1, 2024Updated last year
- ☆32Aug 9, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆85Dec 21, 2024Updated last year
- Code and data for the FACTOR paper☆53Nov 15, 2023Updated 2 years ago
- Code for EMNLP 2020 paper: Analogous Process Structure Induction for Sub-event Sequence Prediction☆11Oct 19, 2020Updated 5 years ago
- Repository for research in the field of Responsible NLP at Meta.☆208Apr 18, 2026Updated last month
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated 2 years ago
- Learning Gender-Neutral Word Embeddings☆47Oct 3, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆30Oct 1, 2024Updated last year
- ☆94Jun 6, 2022Updated 3 years ago
- Sensitive-rs is a Rust library for finding, validating, filtering, and replacing sensitive words. It provides efficient algorithms to han…☆24May 11, 2026Updated last week
- Aligning AI With Shared Human Values (ICLR 2021)☆320Apr 21, 2023Updated 3 years ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆133Feb 24, 2025Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated last year
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆109Dec 16, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆19Dec 8, 2022Updated 3 years ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated 2 years ago
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆23Feb 16, 2024Updated 2 years ago
- Converter for EN16931 invoices from CII to UBL☆41May 8, 2026Updated last week
- TCM Lingdan LLM☆52Nov 3, 2024Updated last year
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems☆16Jun 8, 2021Updated 4 years ago
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆49Dec 4, 2024Updated last year