ALEEEHU / World-SimulatorLinks
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repository for the latest updates! π₯
β253Updated last week
Alternatives and similar repositories for World-Simulator
Users that are interested in World-Simulator are comparing it to the libraries listed below
Sorting:
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddingsβ205Updated 3 months ago
- [ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"β267Updated 11 months ago
- "4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Weiβ236Updated 11 months ago
- List of papers on 4D Generation.β274Updated 7 months ago
- Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesisβ2Updated 7 months ago
- [SGP 2025] OctFusion: Octree-based Diffusion Models for 3D Shape Generationβ204Updated 2 weeks ago
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liβ¦β300Updated 4 months ago
- V3D: Video Diffusion Models are Effective 3D Generatorsβ479Updated last year
- The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".β115Updated 8 months ago
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D Worldβ267Updated 6 months ago
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabiβ¦β88Updated 5 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligenceβ179Updated last week
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physicsβ106Updated last month
- [CVPR2025] Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Dataβ61Updated last month
- [Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videosβ325Updated 9 months ago
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).β345Updated 2 months ago
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.β183Updated last year
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, β¦β128Updated last month
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science andβ¦β163Updated last month
- [CVPR'24] Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priorsβ170Updated last year
- [NeurIPS 2024] Animate3D: Animating Any 3D Model with Multi-view Video Diffusionβ190Updated 7 months ago
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)β299Updated 7 months ago
- [ECCV2024] DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Samplingβ151Updated 3 months ago
- [ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".β118Updated 4 months ago
- β129Updated 5 months ago
- Code for PhysDreamerβ561Updated 3 months ago
- [ICML 2024] GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splattingβ298Updated 10 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"β52Updated 2 weeks ago
- [AAAI'2024] IT3D: Improved Text-to-3D Generation with Explicit View Synthesisβ219Updated last year
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3Dβ128Updated 2 months ago