XiandaGuo / Drive-MLLMLinks

[NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models

☆74

Alternatives and similar repositories for Drive-MLLM

Users that are interested in Drive-MLLM are comparing it to the libraries listed below

Sorting:

wzzheng / Doe
Doe-1: Closed-Loop Autonomous Driving with Large World Model
☆109Updated 10 months ago
ayesha-ishaq / DriveLMM-o1
Benchmark and model for step-by-step reasoning in autonomous driving.
☆67Updated 8 months ago
Robertwyq / Drivingdojo
[NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
☆79Updated last year
zc-zhao / DriveMonkey
the official code of DriveMonkey
☆39Updated 6 months ago
HGao-cv / RAD
project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"
☆19Updated 2 months ago
xmed-lab / NuInstruct
☆70Updated last year
zhijian11 / RoboTron-Drive
☆96Updated 11 months ago
Robot-K / Hint-AD
CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving
☆69Updated last year
LLVM-AD / MAPLM
[CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding
☆163Updated 2 years ago
EMZucas / minidrive
☆73Updated 3 months ago
DLUT-LYZ / CODA-LM
Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)
☆97Updated last year
hustvl / RAD
[NeurIPS 2025] RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
☆134Updated last month
memberRE / AsyncDriver
[ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving
☆107Updated 6 months ago
jxbbb / TOD3Cap
[ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
☆129Updated 9 months ago
zympsyche / BevWorld
☆124Updated last year
fudan-zvg / Reason2Drive
[ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
☆96Updated last year
YanhaoWu / UMGen
Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction
☆96Updated last month
Depth2World / VLADBench
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)
☆33Updated 6 months ago
wudongming97 / Prompt4Driving
[AAAI2025] Language Prompt for Autonomous Driving
☆150Updated 2 months ago
4DVLab / IDKB
Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"
☆29Updated 6 months ago
xiaomi-research / recogdrive
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
☆363Updated this week
akshaygopalkr / EM-VLM4AD
☆90Updated last year
alexanderswerdlow / BEVGen
Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase
☆77Updated last year
PurdueDigitalTwin / LaMPilot
[CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs
☆32Updated last year
michigan-traffic-lab / LightEMMA
This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"
☆127Updated 2 weeks ago
BraveGroup / LAW
(ICLR2025) Enhancing End-to-End Autonomous Driving with Latent World Model
☆282Updated 5 months ago
OpenDriveLab / ELM
[ECCV 2024] Embodied Understanding of Driving Scenarios
☆206Updated 5 months ago
PJLab-ADG / LeapAD
[NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
☆140Updated 10 months ago
hustvl / AlphaDrive
Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
☆300Updated 8 months ago
jbji / RepVF
[ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"
☆33Updated 8 months ago