microsoft / Alympics
☆50Updated 8 months ago
Alternatives and similar repositories for Alympics:
Users that are interested in Alympics are comparing it to the libraries listed below
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆64Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆26Updated 7 months ago
- A benchmark for evaluating learning agents based on just language feedback☆66Updated 4 months ago
- ☆213Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆214Updated 3 months ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆91Updated last week
- ProAgent: Building Proactive Cooperative Agents with Large Language Models☆72Updated 10 months ago
- ☆79Updated 7 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆109Updated 8 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆69Updated 5 months ago
- ☆141Updated 9 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆50Updated this week
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆131Updated 10 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆102Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆110Updated 9 months ago
- ☆83Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆82Updated 4 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆244Updated 5 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆64Updated 7 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆242Updated 4 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- ☆54Updated 7 months ago
- ☆65Updated 10 months ago
- ☆46Updated 9 months ago
- An extensible benchmark for evaluating large language models on planning☆323Updated 9 months ago
- WarAgent: LLM-based Multi-Agent Simulation of World Wars☆244Updated 11 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆43Updated last year
- ☆50Updated 9 months ago
- Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Lear…☆39Updated 6 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 4 months ago