XiandaGuo / Drive-MLLMLinks
[NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models
☆78Updated 4 months ago
Alternatives and similar repositories for Drive-MLLM
Users that are interested in Drive-MLLM are comparing it to the libraries listed below
Sorting:
- Benchmark and model for step-by-step reasoning in autonomous driving.☆68Updated 10 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆116Updated last year
- ☆70Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆83Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆99Updated last year
- the official code of DriveMonkey☆43Updated 8 months ago
- DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning☆76Updated last month
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆128Updated 10 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆72Updated last year
- ☆73Updated 5 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆164Updated 2 years ago
- [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning☆178Updated 2 months ago
- ☆101Updated last year
- Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”☆115Updated last month
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆20Updated 4 months ago
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆98Updated 2 years ago
- [AAAI2025] Language Prompt for Autonomous Driving☆154Updated 4 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆100Updated 2 months ago
- [ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving☆110Updated 8 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆30Updated 8 months ago
- This is the official project repository for "DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diff…☆37Updated 5 months ago
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆80Updated last year
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)☆37Updated 8 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 10 months ago
- ☆127Updated last year
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆95Updated 11 months ago
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆141Updated last year
- ☆71Updated 2 months ago
- Official Code Release of Delphi☆56Updated last year
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Updated last year