mukobi / welfare-diplomacyLinks
General-Sum variant of the game Diplomacy for evaluating AIs.
☆34Updated last year
Alternatives and similar repositories for welfare-diplomacy
Users that are interested in welfare-diplomacy are comparing it to the libraries listed below
Sorting:
- ☆144Updated 6 months ago
- Governance of the Commons Simulation (GovSim)☆64Updated last year
- ☆110Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆101Updated 2 years ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆244Updated last month
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆124Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆333Updated last month
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆44Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆39Updated last year
- ☆216Updated 2 years ago
- An extensible benchmark for evaluating large language models on planning☆445Updated 4 months ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆138Updated 8 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆277Updated last week
- Algebraic value editing in pretrained language models☆67Updated 2 years ago
- ☆137Updated last year
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆137Updated last year
- ☆328Updated last year
- Formal Contracts for Multi-Agent Reinforcement Learning☆19Updated 2 years ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆126Updated 10 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆100Updated 2 years ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆199Updated 10 months ago
- ☆119Updated last year
- ☆65Updated last week
- ☆32Updated 2 years ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆202Updated 9 months ago
- Reasoning with Language Model is Planning with World Model☆185Updated 2 years ago
- Measuring the situational awareness of language models☆40Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆503Updated 9 months ago
- ☆144Updated last year