mukobi / welfare-diplomacyLinks
General-Sum variant of the game Diplomacy for evaluating AIs.
☆29Updated last year
Alternatives and similar repositories for welfare-diplomacy
Users that are interested in welfare-diplomacy are comparing it to the libraries listed below
Sorting:
- ☆137Updated 8 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆82Updated last year
- Governance of the Commons Simulation (GovSim)☆55Updated 5 months ago
- Interpreting how transformers simulate agents performing RL tasks☆87Updated last year
- ☆98Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆275Updated this week
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆112Updated last year
- ☆69Updated last month
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆34Updated 8 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆118Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆234Updated 8 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆95Updated last year
- ☆206Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆110Updated 3 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆181Updated 3 months ago
- Reasoning with Language Model is Planning with World Model☆168Updated last year
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Algebraic value editing in pretrained language models☆65Updated last year
- An extensible benchmark for evaluating large language models on planning☆386Updated 3 weeks ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- ☆121Updated 11 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆183Updated 2 years ago
- ☆283Updated last year
- ☆28Updated last year
- ☆27Updated 2 years ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆118Updated last month
- A library for efficient patching and automatic circuit discovery.☆70Updated 2 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆206Updated 2 years ago
- ☆87Updated 11 months ago
- Aligning AI With Shared Human Values (ICLR 2021)☆289Updated 2 years ago