yisuanwang / Idea23D
[COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
☆47Updated 3 months ago
Alternatives and similar repositories for Idea23D:
Users that are interested in Idea23D are comparing it to the libraries listed below
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆12Updated 7 months ago
- ☆35Updated last month
- ☆15Updated 4 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆70Updated last week
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆63Updated 3 weeks ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆46Updated 4 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", arXiv 2025.☆62Updated 3 weeks ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆30Updated 2 months ago
- An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contri…☆56Updated last year
- A curated list of awesome 3D scene generation papers☆25Updated this week
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆90Updated 4 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆42Updated last month
- ☆35Updated last month
- [ECCV 2024] GenRC: 3D Indoor Scene Generation from Sparse Image Collections☆27Updated 3 months ago
- Seeing World Dynamics in a Nutshell☆106Updated last month
- [NeurIPS'2024] Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly☆54Updated 5 months ago
- IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement☆15Updated 2 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆21Updated last week
- open-sourced video dataset with dynamic scenes and camera movements annotation☆50Updated 2 weeks ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆31Updated 2 months ago
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆75Updated last month
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆61Updated 6 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆147Updated 6 months ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆39Updated 7 months ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆13Updated 3 months ago
- BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation (AAAI 2025)☆14Updated 3 months ago
- ☆47Updated 3 weeks ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆47Updated last month
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆31Updated last month