google-research-datasets / sanpo_dataset
☆41Updated 5 months ago
Alternatives and similar repositories for sanpo_dataset:
Users that are interested in sanpo_dataset are comparing it to the libraries listed below
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆75Updated 3 weeks ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆86Updated 2 months ago
- Codebase for the WayveScenes101 Dataset☆174Updated 6 months ago
- Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"☆80Updated 3 months ago
- Unifying 2D and 3D Vision-Language Understanding☆49Updated 2 weeks ago
- Bridging lidar and text through image intermediaries☆85Updated last year
- [NeurIPS'24 Spotlight] Is Your LiDAR Placement Optimized for 3D Scene Understanding?☆39Updated 6 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆54Updated last month
- ☆46Updated 4 months ago
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆48Updated 9 months ago
- Official Github Repo for GEM☆32Updated 3 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆118Updated last year
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆58Updated last year
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆28Updated 3 months ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆38Updated last year
- Toolkit for JRDB dataset☆38Updated 8 months ago
- Mask4Former: Mask Transformer for 4D Panoptic Segmentation☆54Updated 10 months ago
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆41Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆62Updated last year
- ☆90Updated last year
- ☆53Updated 11 months ago
- ☆51Updated last year
- ☆77Updated last year
- GEDepth: Ground Embedding for Monocular Depth Estimation (ICCV 2023)☆57Updated last year
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆94Updated 4 months ago
- ☆30Updated last year
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆49Updated last month
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆102Updated last month
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 8 months ago