XiandaGuo / Drive-MLLMLinks
[NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models
☆73Updated last month
Alternatives and similar repositories for Drive-MLLM
Users that are interested in Drive-MLLM are comparing it to the libraries listed below
Sorting:
- Benchmark and model for step-by-step reasoning in autonomous driving.☆66Updated 8 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆105Updated 9 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆79Updated 11 months ago
- ☆69Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆95Updated 11 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆159Updated last year
- the official code of DriveMonkey☆38Updated 5 months ago
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆19Updated last month
- ☆74Updated 3 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆69Updated last year
- ☆93Updated 10 months ago
- [NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning☆124Updated last week
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆94Updated last year
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆129Updated 8 months ago
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)☆30Updated 5 months ago
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆32Updated last year
- [ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving☆105Updated 5 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 6 months ago
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆137Updated 10 months ago
- [AAAI2025] Language Prompt for Autonomous Driving☆150Updated last month
- This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"☆122Updated 2 months ago
- ☆88Updated last year
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆50Updated 9 months ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆205Updated 4 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆49Updated last year
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- ☆122Updated last year
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆93Updated last week
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆84Updated 9 months ago
- Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning☆295Updated 7 months ago