Zhoues / MineDreamer
[NIPS24W]This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
☆73Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for MineDreamer
- [CVPR2024] This is the official implement of MP5☆84Updated 4 months ago
- ⛏💎 STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆30Updated 10 months ago
- ☆114Updated 4 months ago
- ☆61Updated last month
- ☆29Updated last week
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆17Updated 8 months ago
- Paper collections of the continuous effort start from World Models.☆142Updated 4 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆71Updated 3 weeks ago
- Official implementation of "Self-Improving Video Generation"☆52Updated last week
- HAZARD challenge☆26Updated 6 months ago
- ☆47Updated 2 months ago
- ☆40Updated 11 months ago
- Official Implementation of ReALFRED (ECCV'24)☆24Updated last month
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆139Updated last month
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆208Updated this week
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆129Updated last month
- A minecraft multi agents framework☆35Updated this week
- Official implementation of WebVLN: Vision-and-Language Navigation on Websites☆24Updated 10 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆71Updated 3 months ago
- Codebase for HiP☆87Updated 11 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆62Updated 3 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- ☆64Updated 4 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆56Updated 11 months ago
- Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆47Updated 4 months ago
- SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enh…☆26Updated 3 months ago
- ☆104Updated 2 weeks ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆48Updated 3 weeks ago
- ☆27Updated 5 months ago
- ☆68Updated 3 months ago