snshen / Learning-for-3D-VisionLinks
Assignments from 16-825 Learning for 3D Vision at Carnegie Mellon University
☆11Updated 2 years ago
Alternatives and similar repositories for Learning-for-3D-Vision
Users that are interested in Learning-for-3D-Vision are comparing it to the libraries listed below
Sorting:
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆199Updated 4 months ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆70Updated last month
- Official code for the CVPR 2025 paper "Navigation World Models".☆374Updated 3 weeks ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆132Updated last month
- ☆55Updated 2 weeks ago
- List of papers on 4D Generation.☆294Updated 10 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆125Updated 5 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆165Updated 2 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆26Updated 2 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆252Updated last month
- Cameras as Relative Positional Encoding☆557Updated last week
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆80Updated 2 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆115Updated this week
- SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis☆35Updated 2 months ago
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆462Updated last month
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆322Updated last year
- ☆111Updated last year
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 5 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆162Updated last month
- List of papers on video-centric robot learning☆21Updated 9 months ago
- SceneFun3D ToolKit☆155Updated 4 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆23Updated last month
- ☆169Updated last month
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆14Updated 11 months ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆79Updated last year
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆282Updated 2 weeks ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆77Updated 3 weeks ago
- Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment☆27Updated last month
- Generative World Explorer☆154Updated 2 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆339Updated 2 months ago