snshen / Learning-for-3D-VisionLinks
Assignments from 16-825 Learning for 3D Vision at Carnegie Mellon University
☆13Updated 2 years ago
Alternatives and similar repositories for Learning-for-3D-Vision
Users that are interested in Learning-for-3D-Vision are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆83Updated 6 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆528Updated 2 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆32Updated 11 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Updated last month
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆134Updated 10 months ago
- [ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆144Updated 2 weeks ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆267Updated last month
- Official implementation of Video-DPM☆164Updated 3 weeks ago
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆78Updated 2 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆195Updated 8 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆334Updated 5 months ago
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆357Updated last month
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆142Updated 6 months ago
- Cameras as Relative Positional Encoding☆671Updated last month
- A Comprehensive Survey on World Models for Embodied AI☆209Updated 3 months ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆489Updated 3 months ago
- Reasoning in Space via Grounding in the World☆46Updated 3 months ago
- SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis☆36Updated 7 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆53Updated 2 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Updated 7 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆126Updated 3 months ago
- ☆183Updated 6 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Updated 3 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆128Updated 11 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆69Updated 7 months ago
- This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Plannin…☆48Updated last month
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆38Updated 2 weeks ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆167Updated 3 weeks ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 10 months ago
- A curated list of awesome exploration policy papers.☆13Updated last month