OpenDriveLab/DetAny3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenDriveLab/DetAny3D)

OpenDriveLab / DetAny3D

[ICCV 2025] Detect Anything 3D in the Wild

☆284

Alternatives and similar repositories for DetAny3D

Users that are interested in DetAny3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UVA-Computer-Vision-Lab / ovmono3d
View on GitHub
[3DV 2026] Open Vocabulary Monocular 3D Object Detection
☆98Apr 29, 2026Updated 2 months ago
jjxjiaxue / DetAny3D
View on GitHub
A demo page for DetAny3D
☆14May 29, 2025Updated last year
Lizhuoling / UniMODE
View on GitHub
☆52May 6, 2025Updated last year
UVA-Computer-Vision-Lab / 3d_annotator
View on GitHub
3D BBox refinement interface used in LabelAny3D (NeurIPS 2025)
☆22Jan 6, 2026Updated 6 months ago
cvg / 3D-MOOD
View on GitHub
[ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
☆123Oct 14, 2025Updated 9 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
UVA-Computer-Vision-Lab / LabelAny3D
View on GitHub
[NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild
☆130Jan 6, 2026Updated 6 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,072Jul 3, 2026Updated 2 weeks ago
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
NVlabs / L4P
View on GitHub
(3DV 2026 Oral) L4P -- a feed-forward foundational model designed for multiple low-level 4D vision perception tasks.
☆72Dec 9, 2025Updated 7 months ago
OpenDriveLab / MTGS
View on GitHub
MTGS: Multi-Traversal Gaussian Splatting
☆160Jan 29, 2026Updated 5 months ago
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,648Nov 2, 2025Updated 8 months ago
PuFanqi23 / MonoDGP
View on GitHub
[CVPR 2025] The offical implementation of 'MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors'
☆97Aug 13, 2025Updated 11 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆941Oct 27, 2025Updated 8 months ago
JihyeokKim / MonoDINO-DETR
View on GitHub
MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model
☆46May 27, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
OpenDriveLab / Nexus
View on GitHub
[ICCV 2025] Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation
☆122Jan 6, 2026Updated 6 months ago
zju3dv / BoxDreamer
View on GitHub
Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.
☆108Oct 6, 2025Updated 9 months ago
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,464Aug 27, 2025Updated 10 months ago
CN-ADLab / SAM4D
View on GitHub
[ICCV 2025] SAM4D: Segment Anything in Camera and LiDAR Streams
☆234Sep 23, 2025Updated 9 months ago
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
GZWSAMA / OnePoseviaGen
View on GitHub
[CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.
☆483Aug 14, 2025Updated 11 months ago
VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆428Updated this week
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆5,917Updated this week
facebookresearch / DepthLM_Official
View on GitHub
[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM
☆362Jun 1, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NVlabs / LocateAnything3D
View on GitHub
☆68Apr 8, 2026Updated 3 months ago
Mingqj / OcRFDet
View on GitHub
[ICCV 2025] OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving
☆15Jun 17, 2026Updated last month
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,569Updated this week
SpatialVision / Orient-Anything
View on GitHub
Orient Anything, ICML 2025
☆389Feb 6, 2026Updated 5 months ago
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆13,918May 19, 2026Updated 2 months ago
OpenDriveLab / ReSim
View on GitHub
[NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving
☆183May 5, 2026Updated 2 months ago
facebookresearch / omni3d
View on GitHub
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
☆856Apr 7, 2024Updated 2 years ago
ethz-vlg / mvtracker
View on GitHub
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
☆511Nov 3, 2025Updated 8 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆739Dec 18, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wzzheng / DVGT
View on GitHub
[CVPR 2026] Visual Geometry Transformer for Autonomous Driving
☆327Jun 10, 2026Updated last month
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆382Apr 17, 2026Updated 3 months ago
zbw001 / TAPIP3D
View on GitHub
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
☆411Dec 28, 2025Updated 6 months ago
facebookresearch / fast3r
View on GitHub
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
☆1,598May 7, 2025Updated last year
Robbyant / lingbot-depth
View on GitHub
Masked Depth Modeling for Spatial Perception
☆1,505Jul 8, 2026Updated last week
Tsinghua-MARS-Lab / SLAM-Former
View on GitHub
[ECCV 2026] SLAM-Former: Putting SLAM into One Transformer
☆480Jun 18, 2026Updated last month
HaoyiZhu / SPA
View on GitHub
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆177Jun 19, 2025Updated last year