sethkarten / LLM-EconomistLinks
Official repository of the 2025 paper, LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra.
☆52Updated 4 months ago
Alternatives and similar repositories for LLM-Economist
Users that are interested in LLM-Economist are comparing it to the libraries listed below
Sorting:
- ☆88Updated 3 weeks ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆65Updated 9 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 4 months ago
- ☆33Updated 6 months ago
- ☆40Updated last year
- Implementation of SOAR☆43Updated 2 months ago
- ☆68Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- UQ: Assessing Language Models on Unsolved Questions☆28Updated 3 months ago
- ☆43Updated last year
- ☆73Updated 2 months ago
- ☆104Updated 5 months ago
- Collection of LLM completions for reasoning-gym task datasets☆30Updated 4 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 7 months ago
- ☆31Updated 4 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆66Updated 11 months ago
- ☆41Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 8 months ago
- ☆97Updated this week
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆54Updated last month
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆29Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆106Updated this week
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 3 months ago
- ☆25Updated 6 months ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆51Updated 7 months ago
- accompanying material for sleep-time compute paper☆117Updated 7 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 9 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago