taco-group / NuScenes-SpatialQALinks
☆18Updated 6 months ago
Alternatives and similar repositories for NuScenes-SpatialQA
Users that are interested in NuScenes-SpatialQA are comparing it to the libraries listed below
Sorting:
- the official code of DriveMonkey☆37Updated 5 months ago
- ☆102Updated 11 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆59Updated 7 months ago
- Adding Scene-Centric Forecasting Control to Occupancy World Model☆30Updated 2 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆70Updated last month
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆15Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆49Updated 2 years ago
- Driving Everywhere with Large Language Model Policy Adaptation☆17Updated last year
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated 2 years ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 5 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆47Updated 4 months ago
- Out-of-Distribution Semantic Occupancy Prediction☆19Updated last week
- [ICCV 2025] Official implementation of "AD-GS: Object-Aware B-Spline Gaussian Splatting for Self-Supervised Autonomous Driving"☆28Updated 3 months ago
- [ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.☆38Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆48Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Updated last year
- ☆50Updated last year
- ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)☆80Updated 5 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 10 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆101Updated 9 months ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆31Updated last year
- [ECCV 2024] Occupancy as Set of Points☆90Updated last year
- ☆28Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 7 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆28Updated 6 months ago
- ☆20Updated 9 months ago
- [NeurIPS 2022] 4D Unsupervised Object Discovery☆56Updated last year
- ☆82Updated 2 years ago