S-Abdelnabi / LLM-Deliberation
Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation
☆26Updated 5 months ago
Alternatives and similar repositories for LLM-Deliberation:
Users that are interested in LLM-Deliberation are comparing it to the libraries listed below
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆141Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆69Updated 10 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆16Updated 6 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated last year
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated last month
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- Weak-to-Strong Jailbreaking on Large Language Models☆73Updated last year
- WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…☆111Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆74Updated last year
- ☆31Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- AbstainQA, ACL 2024☆25Updated 6 months ago
- ☆44Updated 7 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆53Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆26Updated 3 weeks ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆73Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆48Updated 4 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆49Updated last year
- ☆42Updated last year
- This repository contains data, code and models for contextual noncompliance.☆21Updated 9 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆115Updated 11 months ago
- ☆63Updated 3 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆91Updated 11 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆85Updated 10 months ago
- Governance of the Commons Simulation (GovSim)☆46Updated 3 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆25Updated 4 months ago
- ☆17Updated 10 months ago