apple / ml-depth-proLinks
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
☆4,660Updated 2 months ago
Alternatives and similar repositories for ml-depth-pro
Users that are interested in ml-depth-pro are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆6,036Updated 5 months ago
- CoTracker is a model for tracking any point (pixel) on a video.☆4,453Updated 5 months ago
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆2,844Updated 2 months ago
- High-resolution models for human tasks.☆5,077Updated 8 months ago
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆7,641Updated last year
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,881Updated 4 months ago
- [CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos☆1,180Updated 2 weeks ago
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foun…☆1,836Updated 4 months ago
- Grounding Image Matching in 3D with MASt3R☆2,358Updated 2 weeks ago
- [CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors☆2,354Updated 4 months ago
- Metric depth estimation from a single image☆2,649Updated 2 months ago
- Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything☆1,319Updated 2 months ago
- [CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching☆1,885Updated last week
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆2,470Updated last month
- [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,376Updated 3 months ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆1,370Updated last week
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆616Updated 2 months ago
- Tracking Any Point (TAP)☆1,579Updated last month
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆16,223Updated 6 months ago
- ☆2,225Updated last year
- DUSt3R: Geometric 3D Vision Made Easy☆6,499Updated 2 weeks ago
- Universal Monocular Metric Depth Estimation☆953Updated 2 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆701Updated 3 months ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆5,690Updated 4 months ago
- DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding☆1,137Updated 3 weeks ago
- 4M: Massively Multimodal Masked Modeling☆1,748Updated last month
- 3D Gaussian Splat Editor☆2,507Updated last week
- Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024☆1,545Updated last year
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,048Updated last month
- [CVPR 2025] RollingDepth: Video Depth without Video Models☆562Updated 4 months ago