google-research-datasets / sanpo_datasetLinks
☆48Updated 5 months ago
Alternatives and similar repositories for sanpo_dataset
Users that are interested in sanpo_dataset are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official Implementation of DINO-Foresight: Looking into the Future with DINO☆134Updated 2 weeks ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆116Updated 8 months ago
- Toolkit for JRDB dataset☆43Updated last year
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆74Updated 2 months ago
- Codebase for the WayveScenes101 Dataset☆189Updated last year
- VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).☆120Updated 5 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆97Updated 10 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆122Updated last year
- Unifying 2D and 3D Vision-Language Understanding☆116Updated 4 months ago
- ☆53Updated last year
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆39Updated 2 years ago
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆59Updated 2 years ago
- Official implementation of DepthLM☆273Updated 2 months ago
- ☆102Updated 8 months ago
- ☆49Updated 2 years ago
- ☆41Updated last year
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated 2 years ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆60Updated 9 months ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆122Updated 6 months ago
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆35Updated last year
- [ICCV 2025] Detect Anything 3D in the Wild☆235Updated 5 months ago
- ☆33Updated last year
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆55Updated 9 months ago
- ☆93Updated 11 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆112Updated 10 months ago
- ☆95Updated last year
- [3DV 2026] Open Vocabulary Monocular 3D Object Detection☆66Updated 2 weeks ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆109Updated 6 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆42Updated last year
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆206Updated 2 weeks ago