cnsdqd-dyb / VillagerAgent-Minecraft-multiagent-framework
(VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework
☆41Updated 3 weeks ago
Alternatives and similar repositories for VillagerAgent-Minecraft-multiagent-framework:
Users that are interested in VillagerAgent-Minecraft-multiagent-framework are comparing it to the libraries listed below
- [NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simu…☆80Updated 3 weeks ago
- [CVPR2024] This is the official implement of MP5☆94Updated 7 months ago
- Official implementation of "Self-Improving Video Generation"☆59Updated last month
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆84Updated 3 months ago
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆62Updated last month
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆81Updated 5 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆36Updated last year
- ☆65Updated 2 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆36Updated 2 weeks ago
- OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆26Updated 3 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆77Updated last week
- ☆111Updated last month
- Latent Motion Token as the Bridging Language for Robot Manipulation☆71Updated last week
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆28Updated last year
- ☆34Updated last month
- ☆24Updated 2 months ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆124Updated 3 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆115Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆113Updated 3 weeks ago
- The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆43Updated last month
- ☆27Updated 7 months ago
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]☆57Updated last week
- Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆61Updated 4 months ago
- Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆128Updated last month
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆109Updated 9 months ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆73Updated 3 weeks ago
- Official Repository of Multi-Object Hallucination in Vision-Language Models (NeurIPS 2024)☆26Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆28Updated 3 weeks ago