myracheng / markedpersonas
Code and data for Marked Personas (ACL 2023)
☆22Updated last year
Alternatives and similar repositories for markedpersonas:
Users that are interested in markedpersonas are comparing it to the libraries listed below
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆71Updated 3 years ago
- Repository for the Bias Benchmark for QA dataset.☆101Updated last year
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆18Updated last year
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆20Updated 3 years ago
- ☆38Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆131Updated 2 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆12Updated last year
- ☆47Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆79Updated 5 months ago
- ☆124Updated last year
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆111Updated 11 months ago
- ☆104Updated 9 months ago
- ☆16Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- ☆53Updated 2 months ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆28Updated 3 months ago
- ☆60Updated last month
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- Codebase, data and models for the SummaC paper in TACL☆87Updated 3 weeks ago
- Awesome LLM for NLG Evaluation Papers☆23Updated last year
- Official repository of "HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning", Findings of EMNLP 2023☆21Updated last year
- ☆25Updated 2 years ago
- ☆10Updated 2 years ago
- ☆25Updated 5 months ago
- The official repo for SocKET: Social Knowledge Evaluation Tests☆23Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆62Updated 3 months ago
- ☆12Updated 2 years ago
- Repository for research in the field of Responsible NLP at Meta.☆196Updated 2 months ago
- Text generation using language models with multiple exit heads☆15Updated 2 weeks ago