facebookresearch / locate-3d
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
☆83Updated this week
Alternatives and similar repositories for locate-3d:
Users that are interested in locate-3d are comparing it to the libraries listed below
- SceneFun3D ToolKit☆131Updated last week
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆143Updated last week
- Unifying 2D and 3D Vision-Language Understanding☆74Updated last week
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆67Updated last year
- ☆95Updated last month
- Code for "Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling" (CoRL 2024)☆100Updated 4 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆87Updated 2 months ago
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)☆105Updated last year
- This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…☆142Updated last month
- [CVPR 2024] Official repository for "Tactile-Augmented Radiance Fields".☆58Updated 2 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 8 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆42Updated 6 months ago
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆78Updated last year
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆42Updated last year
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆97Updated 5 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆302Updated 9 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆99Updated last month
- Official PyTorch implementation of Doduo: Dense Visual Correspondence from Unsupervised Semantic-Aware Flow☆44Updated last year
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆147Updated last week
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆92Updated 2 weeks ago
- PhyRecon: Physically Plausible Neural Scene Reconstruction☆149Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆175Updated 2 weeks ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆68Updated last year
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆51Updated this week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- ☆79Updated 3 weeks ago
- IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos☆41Updated 3 weeks ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆120Updated 3 weeks ago
- Independent PyTorch Implementation of Object Scene Representation Transformer☆48Updated last year