Robbyant / lingbot-depthLinks
Masked Depth Modeling for Spatial Perception
☆797Updated last week
Alternatives and similar repositories for lingbot-depth
Users that are interested in lingbot-depth are comparing it to the libraries listed below
Sorting:
- Towards a Generative 3D World Engine for Embodied Intelligence☆387Updated last week
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆357Updated last month
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆369Updated last month
- Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching☆468Updated last month
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆821Updated 3 months ago
- Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…☆443Updated this week
- ☆470Updated 5 months ago
- [CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera☆294Updated 4 months ago
- [CVPR 25] Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation☆253Updated 4 months ago
- Causal video-action world model for generalist robot control☆541Updated this week
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆412Updated 8 months ago
- Sim-to-real and CDM inference code for ManipAsInSim project.☆140Updated 2 months ago
- A Pragmatic VLA Foundation Model☆683Updated last week
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆167Updated 3 weeks ago
- OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer☆268Updated 3 weeks ago
- [ICCV 2025] PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos☆364Updated 2 weeks ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆489Updated 3 months ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆761Updated last week
- ☆227Updated 4 months ago
- A diffusion model-based stereo depth estimation framework that can predict and restore noisy depth maps for transparent and specular surf…☆87Updated 11 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆162Updated 8 months ago
- ☆183Updated 6 months ago
- ☆416Updated 2 weeks ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆397Updated last month
- [ICCV 2025] Detect Anything 3D in the Wild☆246Updated last month
- [CoRL 2025] Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware☆324Updated 2 months ago
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆145Updated last year
- [CVPR 2025] Any6D: Model-free 6D Pose Estimation of Novel Objects☆386Updated 5 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Updated 7 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆334Updated 5 months ago