yisuanwang / Idea23D
[COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
☆46Updated 2 months ago
Alternatives and similar repositories for Idea23D:
Users that are interested in Idea23D are comparing it to the libraries listed below
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆11Updated 5 months ago
- An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contri…☆56Updated last year
- [CVPR 2024 Hightlight] Code release for "The More You See in 2D, the More You Perceive in 3D"☆60Updated 5 months ago
- ☆31Updated last week
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆21Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated 7 months ago
- [ECCV 2024] GenRC: 3D Indoor Scene Generation from Sparse Image Collections☆26Updated 2 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆40Updated 3 months ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆56Updated last month
- [CVPR 2025] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆20Updated this week
- ☆14Updated 3 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆41Updated 2 weeks ago
- [NeurIPS'2024] Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly☆51Updated 3 months ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆28Updated 3 weeks ago
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆85Updated 3 months ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆38Updated 6 months ago
- Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆17Updated 3 months ago
- Seeing World Dynamics in a Nutshell☆98Updated last week
- Taming Video Diffusion Prior with Scene-Grounding Guidance for 3D Gaussian Splatting from Sparse Inputs (CVPR2025)☆43Updated 3 weeks ago
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆94Updated 6 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆69Updated last week
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆29Updated this week
- This repository is the official implementation for the paper “REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices”.☆21Updated 7 months ago
- ☆38Updated 6 months ago
- Official implementation of "Motion Dreamer: Realizing Physically Coherent Video Generation through Scene-Aware Motion Reasoning"☆12Updated 2 months ago
- ☆55Updated last month
- ☆24Updated last month
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆66Updated last week
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆45Updated last week
- [CVPR 2025] TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting☆24Updated this week