facebookresearch/DepthLM_Official

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/DepthLM_Official)

facebookresearch / DepthLM_Official

[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM

☆362

Alternatives and similar repositories for DepthLM_Official

Users that are interested in DepthLM_Official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆428Updated this week
facebookresearch / VLM3
View on GitHub
Official implementation of paper "VLM³: Vision Language Models Are Native 3D Learners".
☆398Jun 1, 2026Updated last month
InternRobotics / G2VLM
View on GitHub
[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
☆345Apr 18, 2026Updated 3 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,072Jul 3, 2026Updated 2 weeks ago
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆479Feb 5, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,464Aug 27, 2025Updated 10 months ago
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
YkiWu / Point3R
View on GitHub
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
☆191Mar 10, 2026Updated 4 months ago
Inception3D / TTT3R
View on GitHub
[ICLR 2026] A simple state update rule to enhance length generalization for CUT3R
☆683May 11, 2026Updated 2 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆941Oct 27, 2025Updated 8 months ago
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆198Mar 25, 2026Updated 3 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆739Dec 18, 2025Updated 7 months ago
NJU-3DV / SpatialVID
View on GitHub
[CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
☆585Apr 22, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Davidyao99 / uni4d
View on GitHub
[CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
☆225May 25, 2025Updated last year
nv-tlabs / vipe
View on GitHub
ViPE: Video Pose Engine for Geometric 3D Perception
☆2,046Jun 9, 2026Updated last month
hanxunyu / DepthVLM
View on GitHub
🔥 Official code repository for "Unlocking Dense Metric Depth Estimation in VLMs"
☆154May 21, 2026Updated 2 months ago
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,569Updated this week
gangweix / pixel-perfect-depth
View on GitHub
[NeurIPS 2025] Pixel-Perfect Depth
☆1,059Feb 13, 2026Updated 5 months ago
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,648Nov 2, 2025Updated 8 months ago
mega-sam / mega-sam
View on GitHub
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
☆1,336Jan 5, 2026Updated 6 months ago
cambrian-mllm / cambrian-s
View on GitHub
Cambrian-S: Towards Spatial Supersensing in Video
☆560Apr 3, 2026Updated 3 months ago
Junyi42 / monst3r
View on GitHub
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
☆1,381Jun 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
WU-CVGL / SIU3R
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…
☆163Sep 25, 2025Updated 9 months ago
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆542Oct 31, 2025Updated 8 months ago
xiac20 / SimRecon
View on GitHub
[CVPR'26 Highlight] SimRecon: SimReady Compositional Scene Reconstruction from Real Videos
☆132Apr 14, 2026Updated 3 months ago
QitaoZhao / E-RayZer
View on GitHub
[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
☆300May 30, 2026Updated last month
lifuguan / IGGT_official
View on GitHub
[ICLR'26] IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction
☆425Dec 1, 2025Updated 7 months ago
metric-anything / metric-anything
View on GitHub
Accepted to ECCV 2026
☆338Jul 6, 2026Updated 2 weeks ago
hwjiang1510 / MegaSynth
View on GitHub
Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)
☆205May 20, 2025Updated last year
facebookresearch / lagernvs
View on GitHub
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
☆402Jun 26, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year
chenguolin / MoVieS
View on GitHub
[CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".
☆460Mar 19, 2026Updated 4 months ago
ZCMax / LLaVA-3D
View on GitHub
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆384Oct 21, 2025Updated 9 months ago
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆382Apr 17, 2026Updated 3 months ago
Inception3D / Easi3R
View on GitHub
[ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.
☆532Apr 1, 2025Updated last year
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆5,917Updated this week
DengKaiCQ / VGGT-Long
View on GitHub
Official implement of VGGT-Long
☆882Mar 20, 2026Updated 4 months ago