microsoft / Alympics
☆43Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for Alympics
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆62Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆98Updated 5 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated last week
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆32Updated 9 months ago
- A benchmark for evaluating learning agents based on just language feedback☆56Updated last month
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆121Updated 7 months ago
- ☆73Updated 4 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆49Updated 2 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆92Updated last year
- ☆135Updated 6 months ago
- ☆202Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆214Updated 3 weeks ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆40Updated 9 months ago
- An extensible benchmark for evaluating large language models on planning☆289Updated 5 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆219Updated 2 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago
- A repository for transformer critique learning and generation☆85Updated 11 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆17Updated 4 months ago
- An implementations of "Generative Agents: Interactive Simulacra of Human Behavior".☆86Updated last year
- ☆74Updated 5 months ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆82Updated last month
- Reasoning with Language Model is Planning with World Model☆145Updated last year
- ☆121Updated 9 months ago
- WarAgent: LLM-based Multi-Agent Simulation of World Wars☆196Updated 8 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆49Updated 2 months ago
- ☆24Updated 2 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆63Updated 2 months ago
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆273Updated 2 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆118Updated last year
- ☆77Updated last year