allenai / persona-biasLinks
☆27Updated last year
Alternatives and similar repositories for persona-bias
Users that are interested in persona-bias are comparing it to the libraries listed below
Sorting:
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Updated 6 months ago
- ☆32Updated last year
- ☆48Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆85Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆132Updated last year
- ☆27Updated 10 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Updated 2 years ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Updated 2 years ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- Corpus to accompany: "Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest"☆59Updated 10 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆57Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆124Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆48Updated last year
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated 2 years ago
- This repository contains data, code and models for contextual noncompliance.☆24Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated last year
- ☆28Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- Official codes for EMNLP 2024 paper "Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models"☆37Updated last year
- SILO Language Models code repository☆83Updated last year
- ☆83Updated 2 years ago
- ☆36Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆52Updated 5 months ago
- Supporting code for ReCEval paper☆31Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆20Updated last year