Zhoues / MineDreamer
This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
☆71Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MineDreamer
- [CVPR2024] This is the official implement of MP5☆83Updated 4 months ago
- ⛏💎 STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆28Updated 10 months ago
- ☆58Updated last month
- ☆26Updated this week
- ☆113Updated 3 months ago
- ☆40Updated 10 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆17Updated 7 months ago
- Paper collections of the continuous effort start from World Models.☆130Updated 4 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆25Updated 2 months ago
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆132Updated 3 weeks ago
- ☆73Updated this week
- Official implementation of "Self-Improving Video Generation"☆47Updated this week
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆68Updated 2 months ago
- HAZARD challenge☆26Updated 6 months ago
- Official Implementation of ReALFRED (ECCV'24)☆24Updated 3 weeks ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆68Updated last week
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆23Updated 10 months ago
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆55Updated 2 weeks ago
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆29Updated last month
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆67Updated last month
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆46Updated last week
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- Official Repository of Multi-Object Hallucination in Vision-Language Models (NeurIPS 2024)☆24Updated last month
- ☆64Updated 2 weeks ago
- Text world based on Minecraft rules.☆11Updated 5 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆56Updated 10 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆199Updated last month
- A minecraft multi agents framework☆33Updated this week
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆100Updated 7 months ago
- ☆64Updated 4 months ago