Zhoues / MineDreamer
[NeurIPSw'24] This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
☆86Updated 2 months ago
Alternatives and similar repositories for MineDreamer:
Users that are interested in MineDreamer are comparing it to the libraries listed below
- [CVPR2024] This is the official implement of MP5☆99Updated 9 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆36Updated last year
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆70Updated 3 weeks ago
- ☆37Updated last month
- Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆92Updated 2 weeks ago
- ☆44Updated last year
- ☆69Updated 3 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆64Updated last year
- ☆125Updated 8 months ago
- Official implementation of "Self-Improving Video Generation"☆62Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated last month
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆78Updated 6 months ago
- Evaluate Multimodal LLMs as Embodied Agents☆39Updated last month
- ☆84Updated last month
- RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints☆29Updated last week
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆285Updated 10 months ago
- ☆37Updated 3 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆28Updated 4 months ago
- Embodied Agent Interface (EAI): Benchmarking LLMs for Embodied Decision Making (NeurIPS D&B 2024 Oral)☆180Updated 3 weeks ago
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆33Updated 4 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆81Updated last week
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆21Updated last year
- Paper collections of the continuous effort start from World Models.