GenEx-world / genexLinks
Generative World Explorer
☆158Updated 4 months ago
Alternatives and similar repositories for genex
Users that are interested in genex are comparing it to the libraries listed below
Sorting:
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆377Updated 2 weeks ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆159Updated 3 months ago
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆121Updated 3 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆253Updated this week
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆157Updated 3 weeks ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆81Updated 3 months ago
- (CVPR 2025 Highlight) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆241Updated 3 months ago
- ☆149Updated 9 months ago
- Official implementation of DepthLM☆229Updated 3 weeks ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆184Updated 2 weeks ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆244Updated this week
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆44Updated 4 months ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆296Updated 2 weeks ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆135Updated 3 months ago
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆85Updated 3 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆369Updated 4 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆186Updated 2 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆281Updated last month
- A list of works on video generation towards world model☆170Updated 2 weeks ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆346Updated last month
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆186Updated 6 months ago
- Unifying 2D and 3D Vision-Language Understanding☆115Updated 3 months ago
- Self-reimplemented version of 4D-LRM.☆60Updated 5 months ago
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"☆63Updated last week
- Orient Anything, ICML 2025☆339Updated 2 weeks ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆72Updated 6 months ago
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆165Updated 3 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆270Updated last week
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆511Updated this week
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆219Updated 6 months ago