zhangquanchen / 3DThinkerLinks
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
☆68Updated this week
Alternatives and similar repositories for 3DThinker
Users that are interested in 3DThinker are comparing it to the libraries listed below
Sorting:
- This is the repository that contains source code for the PhysGen3D.☆231Updated 2 months ago
- ☆140Updated 8 months ago
- Are Video Models Ready as Zero-shot Reasoners?☆77Updated this week
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆132Updated this week
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆248Updated this week
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆789Updated last month
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆134Updated last year
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆174Updated 3 months ago
- A Unified Driving World Model for Future Generation and Perception☆126Updated 4 months ago
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆510Updated last month
- 🌐 3D and 4D World Modeling: A Survey☆646Updated last month
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆103Updated this week
- Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model☆893Updated last week
- [NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations☆423Updated 2 weeks ago
- RynnEC: Bringing MLLMs into Embodied World☆380Updated last month
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆170Updated this week
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆204Updated 4 months ago
- (ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.☆73Updated 5 months ago
- [SIGGRAPH Conference 2024] GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis☆154Updated 7 months ago
- Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆347Updated this week
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆101Updated 2 months ago
- [ECCV2024] DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling☆223Updated 4 months ago
- Official Repository of OmniCaptioner☆166Updated 7 months ago
- Official implementation of "Next-Scale Autoregressive Models are Zero-Shot Single-Image Object View Synthesizers"☆46Updated 8 months ago
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆419Updated 2 months ago
- GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation☆495Updated last year
- The official PyTorch implementation of Diffusion Time-step Curriculum for One Image to 3D Generation (CVPR 2024)☆74Updated last year
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆101Updated 8 months ago
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆1,060Updated last month
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆124Updated last month