giorgiopiatti / GovSim
Governance of the Commons Simulation (GovSim)
☆44Updated 2 months ago
Alternatives and similar repositories for GovSim:
Users that are interested in GovSim are comparing it to the libraries listed below
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated last year
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆80Updated this week
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆67Updated 9 months ago
- ☆68Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆62Updated this week
- Evaluating the Moral Beliefs Encoded in LLMs☆24Updated 3 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆61Updated last month
- ☆23Updated last month
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆73Updated last year
- ☆20Updated 10 months ago
- ☆130Updated 4 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 10 months ago
- Learning to route instances for Human vs AI Feedback☆21Updated last month
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- Official Implementation of "DeLLMa: Decision Making Under Uncertainty with Large Language Models"☆50Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Knowledge Unlearning for Large Language Models☆22Updated 3 weeks ago
- Code/data for MARG (multi-agent review generation)☆41Updated 4 months ago
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆26Updated 4 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆41Updated 8 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆28Updated 11 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆204Updated this week
- The Prism Alignment Project☆70Updated 11 months ago
- ☆64Updated 2 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆97Updated last year
- ☆15Updated last month
- ☆49Updated 7 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆91Updated last year
- ☆55Updated this week