yisuanwang / Idea23D
[COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
☆46Updated 2 months ago
Alternatives and similar repositories for Idea23D:
Users that are interested in Idea23D are comparing it to the libraries listed below
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆12Updated 6 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆42Updated 4 months ago
- [NeurIPS'2024] Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly☆53Updated 4 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆62Updated 2 weeks ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated 7 months ago
- ☆15Updated 3 months ago
- ☆30Updated 2 weeks ago
- Seeing World Dynamics in a Nutshell☆102Updated 3 weeks ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", Arxiv 2025.☆41Updated last week
- An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contri…☆56Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆31Updated 2 months ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆58Updated last month
- Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆20Updated 4 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆22Updated last week
- Code for "Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views", CVPR 2025☆25Updated 2 weeks ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆29Updated last month
- [NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting☆99Updated last week
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆42Updated last month
- [ECCV 2024] GenRC: 3D Indoor Scene Generation from Sparse Image Collections☆26Updated 3 months ago
- ☆58Updated 2 months ago
- ☆40Updated 2 weeks ago
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Updated 8 months ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆12Updated 2 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆80Updated 3 weeks ago
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025 Highlight)☆52Updated last week
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆61Updated 6 months ago
- Open-world 3D part segmentation of point clouds☆75Updated last month
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆86Updated last week
- [NeurIPS 2023] 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection☆49Updated last year
- ☆33Updated 3 weeks ago