microsoft / mineworld
MineWorld: A Real-time interactive world model on Minecraft
☆162Updated last week
Alternatives and similar repositories for mineworld:
Users that are interested in mineworld are comparing it to the libraries listed below
- [ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videos☆281Updated last month
- ☆275Updated 2 months ago
- A Unified Tokenizer for Visual Generation and Understanding☆256Updated last week
- ☆126Updated 3 months ago
- Official Implementation of Video-T1: Test-Time Scaling for Video Generation☆246Updated 2 weeks ago
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆82Updated 3 months ago
- Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models☆536Updated this week
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆264Updated last month
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆468Updated 2 weeks ago
- Official PyTorch implementation of TokenSet.☆114Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆71Updated last week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆222Updated last month
- Implementation of a framework for Genie2 in Pytorch☆145Updated 3 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆584Updated 3 weeks ago
- (ECCV 2024) Code for V-IRL: Grounding Virtual Intelligence in Real Life☆343Updated 4 months ago
- Pandora: Towards General World Model with Natural Language Actions and Video States☆502Updated 6 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆149Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆66Updated last month
- Liquid: Language Models are Scalable and Unified Multi-modal Generators☆517Updated 2 weeks ago
- Official Implementation of "JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse"☆69Updated 3 weeks ago
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆262Updated last month
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆295Updated 8 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆180Updated 2 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆118Updated 3 weeks ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆266Updated last month
- ☆94Updated 2 weeks ago
- A Video Tokenizer Evaluation Dataset☆112Updated 3 months ago
- DDT: Decoupled Diffusion Transformer☆184Updated last week
- WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens☆197Updated last year
- ☆358Updated 6 months ago