google-research-datasets / sanpo_dataset
☆41Updated 5 months ago
Alternatives and similar repositories for sanpo_dataset:
Users that are interested in sanpo_dataset are comparing it to the libraries listed below
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆81Updated last month
- ☆30Updated 2 years ago
- Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"☆81Updated 4 months ago
- Codebase for the WayveScenes101 Dataset☆176Updated 7 months ago
- Toolkit for JRDB dataset☆38Updated 8 months ago
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 10 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆58Updated 7 months ago
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆29Updated 4 months ago
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆58Updated last year
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆87Updated 2 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆54Updated last month
- Official Github Repo for GEM☆44Updated last week
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆38Updated last year
- ☆18Updated last month
- ☆79Updated 3 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆97Updated 5 months ago
- ☆55Updated 5 months ago
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆106Updated 3 months ago
- Unifying 2D and 3D Vision-Language Understanding☆74Updated 2 weeks ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆116Updated last month
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆83Updated this week
- Bridging lidar and text through image intermediaries☆86Updated last year
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆42Updated last year
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆303Updated 9 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆103Updated 2 months ago
- ☆53Updated last year
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆25Updated last year
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆67Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- ☆97Updated last year