Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆87Mar 2, 2021Updated 5 years ago
Alternatives and similar repositories for bold
Users that are interested in bold are comparing it to the libraries listed below
Sorting:
- Dataset + classifier tools to study social perception biases in natural language generation☆71Jun 12, 2023Updated 2 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆154Aug 18, 2025Updated 7 months ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆21Jul 6, 2021Updated 4 years ago
- Repository for the Bias Benchmark for QA dataset.☆139Jan 8, 2024Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆130Mar 1, 2024Updated 2 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆200Dec 8, 2022Updated 3 years ago
- Papers on fairness in NLP☆452May 2, 2024Updated last year
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- Narrative Understanding Workshop paper (2021) on gender in GPT-3 generated stories☆14May 28, 2021Updated 4 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆89Aug 20, 2021Updated 4 years ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆344Jun 17, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Butler 是一 个用于自动化服务管理和任务调度的工具项目。☆16Mar 11, 2026Updated last week
- ☆19Jun 21, 2025Updated 9 months ago
- ☆13Jun 25, 2025Updated 8 months ago
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- Generated geosite.dat based on Antifilter Community List☆25Updated this week
- ☆44Oct 1, 2024Updated last year
- ☆32Aug 9, 2024Updated last year
- ☆230Feb 23, 2021Updated 5 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆84Dec 21, 2024Updated last year
- Code and data for the FACTOR paper☆53Nov 15, 2023Updated 2 years ago
- Repository for research in the field of Responsible NLP at Meta.☆204Feb 20, 2026Updated last month
- [ACL 2020] Towards Debiasing Sentence Representations☆63Nov 21, 2022Updated 3 years ago
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆29Oct 1, 2024Updated last year
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆93May 9, 2024Updated last year
- Learning Gender-Neutral Word Embeddings☆47Oct 3, 2019Updated 6 years ago
- Aligning AI With Shared Human Values (ICLR 2021)☆316Apr 21, 2023Updated 2 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆107Dec 16, 2025Updated 3 months ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- Data for evaluating gender bias in coreference resolution systems.☆81May 14, 2019Updated 6 years ago
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆19Dec 8, 2022Updated 3 years ago
- TCM Lingdan LLM☆47Nov 3, 2024Updated last year
- Converter for EN16931 invoices from CII to UBL☆40Updated this week
- Code for Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems☆16Jun 8, 2021Updated 4 years ago
- [ICLR 2025] A Closer Look at Machine Unlearning for Large Language Models☆46Dec 4, 2024Updated last year
- WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…☆160May 29, 2025Updated 9 months ago