BladeTransformerLLC / OvercookedGPT
An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.
☆64Updated last year
Alternatives and similar repositories for OvercookedGPT:
Users that are interested in OvercookedGPT are comparing it to the libraries listed below
- ☆140Updated 8 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆127Updated 9 months ago
- ☆76Updated 6 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆90Updated last year
- ☆13Updated 10 months ago
- ☆46Updated last month
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆45Updated last year
- ☆83Updated 7 months ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated last month
- ☆26Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆127Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆233Updated 5 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated 11 months ago
- ProAgent: Building Proactive Cooperative Agents with Large Language Models☆69Updated 9 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆108Updated last week
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆27Updated 3 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆209Updated 2 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆81Updated 5 months ago
- Implementation of TWOSOME☆62Updated 2 weeks ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated 3 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 7 months ago
- A benchmark for evaluating learning agents based on just language feedback☆64Updated 3 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆60Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆125Updated 10 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆100Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 2 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆260Updated 7 months ago
- A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM☆66Updated 5 months ago
- Reasoning with Language Model is Planning with World Model☆156Updated last year