GenEx-world / genex
Generative World Explorer
☆143Updated 5 months ago
Alternatives and similar repositories for genex:
Users that are interested in genex are comparing it to the libraries listed below
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- ☆126Updated 4 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆96Updated 2 weeks ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆120Updated last month
- [ArXiv 2025] WORLDMEM: Long-term Consistent World Simulation with Memory☆93Updated this week
- Aether: Geometric-Aware Unified World Modeling☆286Updated last month
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆69Updated 3 weeks ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆70Updated last week
- SceneFun3D ToolKit☆132Updated 2 weeks ago
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆252Updated 5 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆126Updated last month
- The official implementation of "CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆89Updated 3 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆155Updated last week
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆225Updated 7 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆83Updated 3 weeks ago
- Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆163Updated 3 weeks ago
- ☆265Updated 3 weeks ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆19Updated last month
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆79Updated last week
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆177Updated 3 weeks ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆95Updated 3 weeks ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆248Updated 6 months ago
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆191Updated 2 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆98Updated this week
- Unifying 2D and 3D Vision-Language Understanding☆79Updated 3 weeks ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆305Updated 9 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", arXiv 2025.☆62Updated 2 weeks ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆128Updated last month
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆165Updated last year