cnsdqd-dyb / VillagerAgent-Minecraft-multiagent-framework
(VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework
☆40Updated this week
Alternatives and similar repositories for VillagerAgent-Minecraft-multiagent-framework:
Users that are interested in VillagerAgent-Minecraft-multiagent-framework are comparing it to the libraries listed below
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆77Updated 6 months ago
- Official implementation of "Self-Improving Video Generation"☆57Updated 3 weeks ago
- [CVPR2024] This is the official implement of MP5☆92Updated 6 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆35Updated last year
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆78Updated 4 months ago
- ☆65Updated last month
- Latent Motion Token as the Bridging Language for Robot Manipulation☆65Updated last month
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆78Updated 2 months ago
- ☆56Updated 4 months ago
- ☆44Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆50Updated this week
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆96Updated 2 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆70Updated 3 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆54Updated 2 months ago
- ☆43Updated 9 months ago
- ☆99Updated last week
- The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆41Updated 2 weeks ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆105Updated 8 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆124Updated 2 months ago
- ☆69Updated 4 months ago
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆27Updated last year
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆63Updated last week
- Egocentric Video Understanding Dataset (EVUD)☆24Updated 6 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆21Updated 10 months ago
- ☆123Updated 6 months ago
- ☆27Updated 6 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆35Updated last month
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆62Updated 2 months ago
- ☆34Updated last week
- Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆60Updated 3 months ago