talk2car / Talk2Car
The official Talk2Car dataset repo
☆78Updated 2 weeks ago
Alternatives and similar repositories for Talk2Car:
Users that are interested in Talk2Car are comparing it to the libraries listed below
- Berkeley Deep Drive-X (eXplanation) dataset☆114Updated 6 years ago
- ☆80Updated 3 years ago
- [ECCV 2022] Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining☆82Updated 2 years ago
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆179Updated 5 months ago
- Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆79Updated last year
- [AAAI2025] Language Prompt for Autonomous Driving☆131Updated 3 months ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆160Updated 6 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆67Updated last month
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆88Updated 5 months ago
- ☆57Updated 7 months ago
- ☆173Updated last year
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆29Updated 11 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆86Updated 3 months ago
- [WACV 2024 LLVM-AD Challenge] UCU Dataset☆15Updated last year
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆184Updated 3 months ago
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆109Updated 4 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆130Updated last year
- [CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"☆48Updated last week
- Benchmark and model for step-by-step reasoning in autonomous driving.☆38Updated 2 weeks ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- ☆12Updated 3 years ago
- Explaining Autonomous Driving Actions with Visual Question Answering (IEEE ITSC-2023)☆17Updated last year
- [CoRL 2023] The official code for paper "Language Conditioned Traffic Generation"☆77Updated 9 months ago
- Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆59Updated last month
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Updated 11 months ago
- [ICCV 2023 Oral] A New Paradigm for End-to-end Autonomous Driving to Alleviate Causal Confusion☆221Updated last year
- ☆31Updated last year
- ☆36Updated last month
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆90Updated last year
- ☆31Updated last year