Petrichor625 / Talk2car_CAVGLinks
[Communication in Transprotation Reasearch] Official PyTorch Implementation of ''GPT-4 enhanced multimodal grounding for autonomous driving: Leveraging cross-modal attention with large language models.''
☆25Updated last year
Alternatives and similar repositories for Talk2car_CAVG
Users that are interested in Talk2car_CAVG are comparing it to the libraries listed below
Sorting:
- ☆180Updated last year
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆98Updated last year
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆115Updated last year
- ☆100Updated 11 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆98Updated last year
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆35Updated 2 years ago
- [AAAI2025] Language Prompt for Autonomous Driving☆152Updated 2 months ago
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆217Updated last year
- [ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving☆107Updated 6 months ago
- ☆90Updated last year
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆140Updated 11 months ago
- Learning to Drive with GPT☆291Updated last year
- [CVPR 2024] On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving☆148Updated last year
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆115Updated last year
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆208Updated 5 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆163Updated 2 years ago
- ☆93Updated last year
- A Language Agent for Autonomous Driving☆288Updated last week
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆197Updated last year
- ☆45Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆67Updated 9 months ago
- Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning☆301Updated 8 months ago
- This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"☆129Updated last month
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆26Updated last year
- [ICLR 2024] DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models☆295Updated last year
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆19Updated 8 months ago
- Drive-Pi0 and DriveMoE on End-to-end Autonomous Driving☆128Updated last week
- FlowDrive: Energy Flow Field for End-to-End Autonomous Driving☆41Updated 3 months ago
- ☆70Updated last year
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆74Updated 2 months ago