XiandaGuo / Drive-MLLMLinks
☆50Updated 2 months ago
Alternatives and similar repositories for Drive-MLLM
Users that are interested in Drive-MLLM are comparing it to the libraries listed below
Sorting:
- Benchmark and model for step-by-step reasoning in autonomous driving.☆65Updated 5 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆99Updated 7 months ago
- ☆65Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆92Updated 8 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆146Updated last year
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆103Updated 3 weeks ago
- ☆75Updated last week
- [AAAI2025] Language Prompt for Autonomous Driving☆145Updated 8 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆73Updated 8 months ago
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆31Updated last year
- the official code of DriveMonkey☆30Updated 3 months ago
- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆90Updated last year
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆64Updated 9 months ago
- ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving☆143Updated this week
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆46Updated 11 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆127Updated 5 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 5 months ago
- ☆115Updated last year
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆19Updated 6 months ago
- ☆82Updated 8 months ago
- This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"☆102Updated this week
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆111Updated 9 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated 3 months ago
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)☆24Updated 2 months ago
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆132Updated 7 months ago
- 【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆51Updated last year
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆202Updated last month
- ECCV 2024 Paper List about Autonomous Driving☆127Updated 10 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆31Updated last year
- ☆85Updated 9 months ago