Beckschen / genex
Generative World Explorer
☆138Updated 3 months ago
Alternatives and similar repositories for genex:
Users that are interested in genex are comparing it to the libraries listed below
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆229Updated 3 months ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆114Updated 4 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆96Updated 4 months ago
- SceneFun3D ToolKit☆125Updated last week
- (CVPR 2025) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆173Updated 3 weeks ago
- ☆121Updated 2 months ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆72Updated last week
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆219Updated 5 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆113Updated this week
- ☆251Updated 2 months ago
- GenXD: Generating Any 3D and 4D Scenes☆176Updated last month
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆237Updated 4 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆67Updated this week
- Unifying 2D and 3D Vision-Language Understanding☆41Updated this week
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆177Updated last year
- The official implementation of "CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆85Updated 2 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆163Updated 2 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆135Updated last week
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆105Updated last week
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆233Updated 2 weeks ago
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆160Updated 10 months ago
- Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…☆240Updated this week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆38Updated 3 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆296Updated 8 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated 7 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆269Updated 3 weeks ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆90Updated 2 weeks ago
- [NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing☆106Updated 4 months ago
- ☆54Updated last month
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆115Updated 3 weeks ago