google-research-datasets / sanpo_dataset
☆39Updated 3 weeks ago
Related projects: ⓘ
- Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"☆51Updated last month
- ☆29Updated last year
- Toolkit for JRDB dataset☆31Updated last month
- Source code for "To Adapt or Not to Adapt? Real-Time Adaptation for Semantic Segmentation", ICCV 2023☆47Updated 3 months ago
- Mask4Former: Mask Transformer for 4D Panoptic Segmentation☆37Updated 4 months ago
- ☆56Updated 10 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆240Updated 2 months ago
- ☆82Updated 5 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆102Updated 8 months ago
- [ICLR 2024] This is the official code of the paper "V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection"☆83Updated 5 months ago
- Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica dataset…☆62Updated 3 weeks ago
- Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆42Updated 5 months ago
- [CVPR2024] Open-world Semantic Segmentation Including Class Similarity☆62Updated last month
- [ICRA2024] Few-Shot Panoptic Segmentation With Foundation Models☆27Updated last month
- Codebase for the WayveScenes101 Dataset☆154Updated last month
- [CoRL2023] Open-Vocabulary Scene-Graph☆47Updated 8 months ago
- ☆51Updated 10 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆72Updated last month
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆56Updated 11 months ago
- Bridging lidar and text through image intermediaries☆75Updated 7 months ago
- WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆75Updated 9 months ago
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆64Updated last month
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆88Updated 4 months ago
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆60Updated 6 months ago
- A project for computing high-quality ground truth training examples for RGB-D data.☆43Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆49Updated 3 months ago
- [CVPR 2024 Highlight] OpenESS: Event-Based Semantic Scene Understanding with Open Vocabularies☆39Updated 4 months ago
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆35Updated 11 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆78Updated 2 months ago
- A modular library for visual 4D scene understanding☆17Updated 2 weeks ago