CraftJarvis / MCU
MCU: A Task-centric Framework for Open-ended Agent Evaluation in Minecraft
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MCU
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆56Updated 10 months ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆42Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- ☆73Updated 4 months ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆32Updated 9 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆49Updated 2 months ago
- BASALT Benchmark datasets, evaluation code and agent training example.☆19Updated 11 months ago
- ☆27Updated last week
- ☆11Updated 7 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆121Updated 7 months ago
- ☆40Updated 11 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆24Updated 2 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆94Updated 3 weeks ago
- Repo to reproduce the First-Explore paper results☆36Updated last week
- ☆22Updated 4 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- ☆74Updated 5 months ago
- Code for "Interactive Task Planning with Language Models"☆25Updated last year
- Official Repo of LangSuitE☆78Updated 2 months ago
- A benchmark for evaluating learning agents based on just language feedback☆56Updated last month
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆254Updated last year
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆128Updated 3 weeks ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆79Updated this week
- Official implementation of Zero-Hero paper☆12Updated 3 months ago
- ☆113Updated 4 months ago
- Official implementation of LoT paper: "Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic"☆18Updated 7 months ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- Text world based on Minecraft rules.☆11Updated 5 months ago