BladeTransformerLLC/OvercookedGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BladeTransformerLLC/OvercookedGPT)

BladeTransformerLLC / OvercookedGPT

An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic multi-agent settings.

☆73

Alternatives and similar repositories for OvercookedGPT

Users that are interested in OvercookedGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HumanCompatibleAI / overcooked-demo
View on GitHub
Web application where humans can play Overcooked with AI agents.
☆60Dec 6, 2022Updated 3 years ago
StephAO / HAHA
View on GitHub
Agents to play overcooked ai
☆15Jul 3, 2024Updated 2 years ago
HumanCompatibleAI / overcooked_ai
View on GitHub
A benchmark environment for fully cooperative human-AI performance.
☆988Mar 22, 2025Updated last year
limafang / agent-arxiv-daily
View on GitHub
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文（已附带中文摘要翻译）
☆37Updated this week
mail-ecnu / Text-Gym-Agents
View on GitHub
This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…
☆22May 29, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rstrivedi / Melting-Pot-Contest-2023
View on GitHub
☆47May 21, 2024Updated 2 years ago
jidiai / GRF_MARL
View on GitHub
Google Research Football MARL Benchmark and Research Toolkit
☆61May 19, 2024Updated 2 years ago
icaros-usc / overcooked_env_gen
View on GitHub
Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.
☆16May 1, 2024Updated 2 years ago
etaoxing / kitchen-shift
View on GitHub
KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts
☆20Jun 21, 2022Updated 4 years ago
ReinholdM / Papers-of-Offline-RL
View on GitHub
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 4 years ago
Bluedotdot2021 / PRML-book_review
View on GitHub
PRML Page-by-page配套资料，对PRML全书及各章节的review
☆17Apr 16, 2024Updated 2 years ago
RickYang2016 / Self-Adaptive-Swarm-System_SASS_MRS2019
View on GitHub
Self-Adaptive_Swarm_System(SASS) for 2019 IEEE International Symposium on Multi-Robot and Multi-Agent Systems (MRS) Version. Paper: Self-…
☆29Mar 9, 2025Updated last year
facebookresearch / NeuralMemory
View on GitHub
A Data Source for Reasoning Embodied Agents
☆20Sep 18, 2023Updated 2 years ago
rosewang2008 / gym-cooking
View on GitHub
🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…
☆224Apr 25, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ishan00 / beating-atari-with-natural-language-guided-rl
View on GitHub
This repository is the implementation of the paper "Beating Atari with Natural Language Guided Reinforcement Learning"
☆12Nov 25, 2018Updated 7 years ago
e-puck2 / monitor
View on GitHub
Multiplatform monitor for e-puck2 robot. Qt project.
☆11Mar 9, 2021Updated 5 years ago
Stanford-ILIAD / Diverse-Conventions
View on GitHub
Exploring techniques to generate diverse conventions in multi-agent settings
☆16Nov 14, 2023Updated 2 years ago
manluo1 / ev-simulator
View on GitHub
☆11Sep 10, 2022Updated 3 years ago
Shanghai-Digital-Brain-Laboratory / DB-Football
View on GitHub
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
☆118Jan 16, 2024Updated 2 years ago
lich14 / CDS
View on GitHub
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
☆87Apr 3, 2023Updated 3 years ago
liming-vie / RUBER
View on GitHub
Implementation of RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems
☆18Jul 8, 2019Updated 7 years ago
UCSB-AI / llm_coordination
View on GitHub
Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…
☆46Oct 13, 2024Updated last year
noio / Domination-Game
View on GitHub
Competitive multi-agent game simulation environment for RL courses.
☆24Jun 4, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yangsizhe / MoVie
View on GitHub
[NeurIPS 2023] MoVie: Visual Model-Based Policy Adaptation for View Generalization
☆12Sep 22, 2023Updated 2 years ago
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆31Dec 12, 2024Updated last year
bic4907 / Overcooked-AI
View on GitHub
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
☆48Sep 11, 2024Updated last year
thunlp / Optima
View on GitHub
Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"
☆72Nov 14, 2024Updated last year
sjtu-marl / ZSC-Eval
View on GitHub
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…
☆56Nov 22, 2025Updated 8 months ago
indylab / nxdo
View on GitHub
Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games
☆40Aug 27, 2021Updated 4 years ago
timjogorman / Multisentence-AMR-guidelines
View on GitHub
Guidelines for our secondary layer of annotation adding multi-sentence AMR links
☆12Sep 6, 2017Updated 8 years ago
MAS-anony / ASN
View on GitHub
☆34Dec 8, 2022Updated 3 years ago
lych1233 / GAMMA-human-ai-collaboration
View on GitHub
☆11Jan 13, 2026Updated 6 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
skypilot-org / skypilot-tutorial
View on GitHub
Tutorial to get started with SkyPilot!
☆59May 15, 2024Updated 2 years ago
lio-wong / llm-operators
View on GitHub
☆11Oct 29, 2024Updated last year
yalidu / liir
View on GitHub
Learning Individual Intrinsic Reward in MARL
☆65Dec 8, 2022Updated 3 years ago
jidiai / olympics_engine
View on GitHub
A simple 2D ball collision engine.
☆12Jun 15, 2023Updated 3 years ago
C-Claus / Basic-BIM-Checker-for-Autodesk-Revit
View on GitHub
A Revit Add-In to check for ProjectBasePoint, SharedBasePoint, Categories, Levels, Assembly Code and FireRating.
☆14Mar 7, 2020Updated 6 years ago
Amanda2024 / GCS_aamas337
View on GitHub
The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》
☆45Dec 31, 2021Updated 4 years ago
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year