cnsdqd-dyb / VillagerAgent-Minecraft-multiagent-framework
(VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework
☆53Updated 2 months ago
Alternatives and similar repositories for VillagerAgent-Minecraft-multiagent-framework:
Users that are interested in VillagerAgent-Minecraft-multiagent-framework are comparing it to the libraries listed below
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆83Updated 2 months ago
- [CVPR2024] This is the official implement of MP5☆99Updated 8 months ago
- Official implementation of "Self-Improving Video Generation"☆62Updated 3 weeks ago
- ☆37Updated last month
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆64Updated last year
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆36Updated last year
- ☆121Updated 2 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆62Updated last week
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆70Updated 2 weeks ago
- Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆88Updated last week
- ☆44Updated last year
- ☆83Updated last month
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆28Updated 4 months ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆52Updated this week
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆127Updated last week
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆21Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆28Updated 2 weeks ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆44Updated 2 weeks ago
- Evaluate Multimodal LLMs as Embodied Agents☆38Updated last month
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆53Updated this week
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆88Updated last week
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆30Updated 7 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated 3 weeks ago
- ☆75Updated 7 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆64Updated 3 weeks ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆95Updated 5 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆24Updated last week
- Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)☆60Updated 3 weeks ago
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆167Updated last week
- ☆67Updated 6 months ago