Beckschen / genex

Generative World Explorer

☆138

Alternatives and similar repositories for genex:

Users that are interested in genex are comparing it to the libraries listed below

ZCMax / LLaVA-3D
A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆229Updated 3 months ago
Junyi42 / GeoAware-SC
Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"
☆114Updated 4 months ago
JeffWang987 / EgoVid
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
☆96Updated 4 months ago
SceneFun3D / scenefun3d
SceneFun3D ToolKit
☆125Updated last week
zzyunzhi / scene-language
(CVPR 2025) The Scene Language: Representing Scenes with Programs, Words, and Embeddings
☆173Updated 3 weeks ago
phyworld / phyworld
☆121Updated 2 months ago
zrporz / 4DLangSplat
Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)
☆72Updated last week
dreamscene4d / dreamscene4d
[NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos
☆219Updated 5 months ago
ZGCTroy / CamI2V
official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
☆113Updated this week
SpatialVision / Orient-Anything
☆251Updated 2 months ago
HeliosZhao / GenXD
GenXD: Generating Any 3D and 4D Scenes
☆176Updated last month
basilevh / gcd
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
☆237Updated 4 months ago
abdo-eldesokey / build-a-scene
Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)
☆67Updated this week
facebookresearch / univlg
Unifying 2D and 3D Vision-Language Understanding
☆41Updated this week
GGGHSL / GraphDreamer
[CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.
☆177Updated last year
hzxie / CityDreamer4D
The official implementation of "CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)
☆85Updated 2 months ago
NVlabs / LSM
[NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D
☆163Updated 2 weeks ago
HaoyiZhu / SPA
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆135Updated last week
Jingkang50 / PSG4D
4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)
☆105Updated last week
kwsong0113 / diffusion-forcing-transformer
Official PyTorch Implementation of "History-Guided Video Diffusion"
☆233Updated 2 weeks ago
cvlab-columbia / pix2gestalt
Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)
☆160Updated 10 months ago
nvidia-cosmos / cosmos-transfer1
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…
☆240Updated this week
YunzeMan / Situation3D
[CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning
☆38Updated 3 months ago
mbanani / probe3d
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
☆296Updated 8 months ago
VITA-Group / Comp4D
"Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…
☆77Updated 7 months ago
ywyue / FiT3D
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
☆269Updated 3 weeks ago
mohammadasim98 / met3r
MEt3R: Measuring Multi-View Consistency in Generated Images
☆90Updated 2 weeks ago
ewrfcas / MVInpainter
[NeurIPS 2024] MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing
☆106Updated 4 months ago
MattWallingford / 360-1M
☆54Updated last month
cyw-3d / SAR3D
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
☆115Updated 3 weeks ago