mbanani / probe3dLinks
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
☆313Updated 11 months ago
Alternatives and similar repositories for probe3d
Users that are interested in probe3d are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆281Updated 3 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆120Updated last year
- SceneFun3D ToolKit☆142Updated 2 months ago
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"☆318Updated last year
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆126Updated 2 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆156Updated 3 weeks ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆156Updated 2 weeks ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆164Updated last month
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆167Updated last week
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆142Updated 7 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆102Updated 3 months ago
- Orient Anything, ICML 2025☆285Updated last month
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆170Updated last year
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆245Updated 3 months ago
- ☆83Updated 2 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆107Updated last month
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆273Updated 6 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆227Updated 8 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆89Updated 3 weeks ago
- ☆135Updated 6 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆124Updated this week
- [ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models☆127Updated 10 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆235Updated this week
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆339Updated 9 months ago
- Improving Semantic Correspondences with Viewpoint-Guided Spherical Maps (CVPR 2024)☆20Updated 6 months ago
- ☆163Updated 4 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆313Updated 11 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆153Updated 3 weeks ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆192Updated 2 months ago