taco-group / NuScenes-SpatialQALinks
☆18Updated 9 months ago
Alternatives and similar repositories for NuScenes-SpatialQA
Users that are interested in NuScenes-SpatialQA are comparing it to the libraries listed below
Sorting:
- the official code of DriveMonkey☆42Updated 7 months ago
- ☆103Updated last year
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆36Updated 4 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆77Updated 4 months ago
- ☆30Updated last year
- Code repository of "GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"☆57Updated last month
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆69Updated last month
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆30Updated 8 months ago
- SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving☆34Updated last week
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆50Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆116Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆96Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆51Updated 2 years ago
- Driving Everywhere with Large Language Model Policy Adaptation☆17Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆83Updated last year
- Official Github Repo for GEM☆101Updated 3 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆32Updated 2 years ago
- Out-of-Distribution Semantic Occupancy Prediction☆21Updated 3 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆50Updated last month
- ☆49Updated 2 months ago
- [ECCV 2024] Occupancy as Set of Points☆92Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Updated 2 years ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆51Updated last month
- ☆22Updated 10 months ago
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Updated last year
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆112Updated 11 months ago
- [ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆39Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 9 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆82Updated 8 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆128Updated 10 months ago