allenai / persona-biasLinks
☆27Updated last year
Alternatives and similar repositories for persona-bias
Users that are interested in persona-bias are comparing it to the libraries listed below
Sorting:
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 5 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆27Updated last year
- ☆29Updated last year
- ☆32Updated last year
- Corpus to accompany: "Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest"☆58Updated 9 months ago
- [NAACL 2025] Towards Rationality in Language and Multimodal Agents: A Survey☆35Updated 10 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated 2 years ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated 2 years ago
- ☆49Updated 8 months ago
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆110Updated 2 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year
- This repository contains data, code and models for contextual noncompliance.☆24Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆57Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆132Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- ☆52Updated 8 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated last year
- ☆47Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆84Updated last year
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆61Updated last year
- SILO Language Models code repository☆83Updated last year
- Research code for "KAT: A Knowledge Augmented Transformer for Vision-and-Language"☆69Updated 3 years ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 11 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆48Updated 11 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆111Updated last year
- Supporting code for ReCEval paper☆30Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆20Updated last year
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Updated last year
- m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks☆44Updated last year