facebookresearch / vggtLinks
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆10,829Updated 2 weeks ago
Alternatives and similar repositories for vggt
Users that are interested in vggt are comparing it to the libraries listed below
Sorting:
- Grounding Image Matching in 3D with MASt3R☆2,469Updated 2 months ago
- CUDA accelerated rasterization of gaussian splatting☆3,572Updated 3 weeks ago
- [CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching☆2,070Updated last week
- [CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space☆1,004Updated last month
- Reference PyTorch implementation and models for DINOv3☆6,749Updated last week
- VGGSfM: Visual Geometry Grounded Deep Structure From Motion☆1,254Updated 6 months ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆1,687Updated last month
- [CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass☆1,239Updated 4 months ago
- Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.☆4,813Updated 4 months ago
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,428Updated last month
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆2,932Updated 4 months ago
- [CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors☆2,516Updated 6 months ago
- [SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields☆2,776Updated 2 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,377Updated last week
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆6,456Updated 7 months ago
- Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources☆1,946Updated last month
- SpatialLM: Training Large Language Models for Structured Indoor Modeling☆3,942Updated 2 weeks ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆2,766Updated this week
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foun…☆1,918Updated 6 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,394Updated 3 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,553Updated 7 months ago
- [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering☆3,034Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Qu…☆2,880Updated 11 months ago
- Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in th…☆7,662Updated 2 weeks ago
- [CVPR 2025] Prompt Depth Anything☆912Updated 2 weeks ago
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,266Updated 3 months ago
- 3D高斯论文,持续更新,欢迎交流讨论。☆2,258Updated last month
- OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,321Updated 3 months ago
- Web-based 3D visualization + Python☆1,864Updated this week
- 🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos☆1,249Updated last week