apple / ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
☆3,847Updated 2 months ago
Alternatives and similar repositories for ml-depth-pro:
Users that are interested in ml-depth-pro are comparing it to the libraries listed below
- [NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation☆4,105Updated 4 months ago
- High-resolution models for human tasks.☆4,631Updated 3 weeks ago
- CoTracker is a model for tracking any point (pixel) on a video.☆3,986Updated last month
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆7,111Updated 5 months ago
- Grounding Image Matching in 3D with MASt3R☆1,412Updated 2 months ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆5,892Updated last week
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆2,444Updated this week
- DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos☆1,049Updated last week
- Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything☆1,027Updated last month
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13,075Updated this week
- 4M: Massively Multimodal Masked Modeling☆1,638Updated 2 months ago
- Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!☆1,028Updated last month
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".☆4,082Updated last week
- Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆523Updated last week
- The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foun…☆1,487Updated 2 weeks ago
- [3DV'25] 3D Reconstruction with Spatial Memory☆800Updated 2 weeks ago
- LightGlue: Local Feature Matching at Light Speed (ICCV 2023)☆3,494Updated 5 months ago
- DUSt3R: Geometric 3D Vision Made Easy☆5,518Updated 2 months ago
- Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.☆1,199Updated last week
- Efficient vision foundation models for high-resolution generation and perception.☆2,468Updated last week
- Cambrian-1 is a family of multimodal LLMs with a vision-centric design.☆1,799Updated last month
- [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering☆2,308Updated last month
- Official repository for LTX-Video☆1,924Updated this week
- GLOMAP - Global Structured-from-Motion Revisited☆1,538Updated this week
- ☆844Updated 4 months ago
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆1,334Updated this week
- [CVPR 2024] Real-Time Open-Vocabulary Object Detection☆4,814Updated last month
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,633Updated last week
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆875Updated this week
- Tracking Any Point (TAP)☆1,343Updated 2 weeks ago