taco-group / NuScenes-SpatialQALinks
☆18Updated 7 months ago
Alternatives and similar repositories for NuScenes-SpatialQA
Users that are interested in NuScenes-SpatialQA are comparing it to the libraries listed below
Sorting:
- the official code of DriveMonkey☆38Updated 5 months ago
- ☆102Updated 11 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆73Updated last month
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆60Updated 8 months ago
- Driving Everywhere with Large Language Model Policy Adaptation☆17Updated last year
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆31Updated 2 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆49Updated 2 years ago
- ☆29Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- [IEEE RA-L 2025] Generate Weather with LLM. Code for "WeatherDG: LLM-assisted Procedural Weather Generation for Domain-Generalized Semant…☆47Updated 5 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 6 months ago
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Updated last year
- ☆49Updated 2 years ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆31Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆84Updated 11 months ago
- [ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆38Updated last year
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 7 months ago
- [ICCV 2025] Official implementation of "AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving"☆28Updated 4 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆105Updated 9 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated 2 years ago
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆61Updated 8 months ago
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆80Updated 6 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆46Updated 4 months ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆47Updated last month
- ☆82Updated 2 years ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆49Updated last year
- [ECCV 2024] Occupancy as Set of Points☆90Updated last year
- Official Pytorch Implementation for "DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving Data Generation" (TOMM)☆24Updated 8 months ago
- Official Github Repo for GEM☆95Updated last month