paul-rottger / issuebenchLinks
Röttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"
☆16Updated 6 months ago
Alternatives and similar repositories for issuebench
Users that are interested in issuebench are comparing it to the libraries listed below
Sorting:
- ☆139Updated 2 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated 2 years ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆55Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Updated 2 years ago
- A corpus and code for understanding norms and subjectivity. 🤖☆53Updated last year
- Highlight errors in a bib file: missing URLs, capitalization protection, etc☆27Updated last year
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆86Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆99Updated 4 years ago
- ☆48Updated 2 years ago
- ☆37Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆24Updated 2 years ago
- Data for evaluating gender bias in coreference resolution systems.☆80Updated 6 years ago
- ☆117Updated last year
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆21Updated 9 months ago
- ☆53Updated last year
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆22Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆51Updated last year
- ☆24Updated last year
- Replication code for "With Little Power Comes Great Responsibility"☆39Updated 5 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆43Updated 2 years ago
- The Prism Alignment Project☆89Updated last year
- ☆23Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆40Updated 3 years ago
- Code for the paper "Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias"☆83Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 3 years ago
- ☆35Updated 4 years ago