PJLab-ADG / GPT4V-AD-Exploration
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
☆290Updated 10 months ago
Alternatives and similar repositories for GPT4V-AD-Exploration:
Users that are interested in GPT4V-AD-Exploration are comparing it to the libraries listed below
- Learning to Drive with GPT☆250Updated 11 months ago
- ☆274Updated 5 months ago
- ☆168Updated last year
- This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. Th…☆188Updated 4 months ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆167Updated 2 weeks ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆53Updated 6 months ago
- Drive Like a Human: Rethinking Autonomous Driving with Large Language Models☆370Updated 5 months ago
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆168Updated 2 months ago
- [WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving☆249Updated 10 months ago
- A Language Agent for Autonomous Driving☆247Updated 9 months ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆153Updated 3 months ago
- [CVPR 2024] A world model for autonomous driving.☆329Updated last year
- [AAAI2025] Language Prompt for Autonomous Driving☆127Updated last month
- A curated list of awesome knowledge-driven autonomous driving (continually updated)☆424Updated 7 months ago
- [ICCV 2023 Oral] A New Paradigm for End-to-end Autonomous Driving to Alleviate Causal Confusion☆207Updated last year
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆119Updated last year
- [ICLR 2024] DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models☆240Updated 10 months ago
- A curated list of world models for autonomous driving. Keep updated.☆224Updated last week
- Bridging Large Vision-Language Models and End-to-End Autonomous Driving☆270Updated 3 weeks ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆106Updated 2 months ago
- [CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System☆655Updated 2 months ago
- PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"☆475Updated 3 months ago
- ☆60Updated 2 months ago
- Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving☆396Updated this week
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆82Updated 3 months ago
- [ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering☆929Updated last week
- [ECCV 2024] GenAD: Generative End-to-End Autonomous Driving☆340Updated last week
- 3D Occupancy Prediction Benchmark in Autonomous Driving☆326Updated 7 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆80Updated last month
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆100Updated last week