yisuanwang / Idea23DLinks
[COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
☆49Updated 4 months ago
Alternatives and similar repositories for Idea23D
Users that are interested in Idea23D are comparing it to the libraries listed below
Sorting:
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆12Updated 8 months ago
- ☆36Updated 2 months ago
- [NeurIPS'2024] Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly☆57Updated 6 months ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆67Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 9 months ago
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆92Updated 5 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆63Updated this week
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆36Updated 2 weeks ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆47Updated 5 months ago
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆36Updated 2 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆71Updated this week
- [CVPR2024] DiffInDScene: Diffusion-based High-Quality 3D Indoor Scene Generation☆87Updated 8 months ago
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆62Updated 7 months ago
- ☆17Updated 5 months ago
- ☆55Updated last month
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆42Updated 2 months ago
- ☆35Updated 2 months ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆50Updated 2 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆34Updated 3 months ago
- Seeing World Dynamics in a Nutshell☆109Updated 2 months ago
- ☆43Updated 5 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", arXiv 2025.☆62Updated last month
- ☆58Updated 2 months ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆31Updated 3 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆22Updated last month
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆65Updated 2 months ago
- [ECCV 2024] GenRC: 3D Indoor Scene Generation from Sparse Image Collections☆27Updated 4 months ago
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆168Updated 8 months ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆14Updated 4 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆94Updated last year