microsoft / mineworld
MineWorld: A Real-time interactive world model on Minecraft
☆321Updated last week
Alternatives and similar repositories for mineworld
Users that are interested in mineworld are comparing it to the libraries listed below
Sorting:
- [ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos☆288Updated last month
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆86Updated 4 months ago
- Dream 7B, a large diffusion language model☆630Updated 2 weeks ago
- ☆284Updated 3 weeks ago
- ☆151Updated this week
- An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆526Updated this week
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆595Updated last month
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆345Updated 3 weeks ago
- Official Implementation of Video-T1: Test-Time Scaling for Video Generation☆258Updated last month
- [ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory☆124Updated last week
- Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving stat…☆529Updated this week
- ☆313Updated this week
- Pandora: Towards General World Model with Natural Language Actions and Video States☆503Updated 7 months ago
- A Unified Tokenizer for Visual Generation and Understanding☆290Updated last week
- Official implementation of UnifiedReward & UnifiedReward-Think☆358Updated this week
- ☆211Updated last week
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆496Updated last week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆239Updated 2 weeks ago
- ☆126Updated 4 months ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆73Updated last month
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆194Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆73Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆166Updated 3 weeks ago
- ☆76Updated last month
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆278Updated last month
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆581Updated 7 months ago
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life☆348Updated 5 months ago
- Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets☆307Updated this week
- Multimodal Models in Real World☆503Updated 2 months ago
- Liquid: Language Models are Scalable and Unified Multi-modal Generators☆573Updated last month