SaFo-Lab / DolphinsLinks
[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“
☆84Updated 10 months ago
Alternatives and similar repositories for Dolphins
Users that are interested in Dolphins are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆96Updated last year
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆193Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆97Updated last year
- [AAAI2025] Language Prompt for Autonomous Driving☆150Updated 2 months ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆206Updated 5 months ago
- ☆70Updated last year
- [AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.☆216Updated last year
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆115Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆67Updated 8 months ago
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆163Updated 2 years ago
- ☆180Updated last year
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆79Updated last year
- On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent☆301Updated last year
- Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)☆115Updated last year
- ☆90Updated last year
- A Language Agent for Autonomous Driving☆287Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆109Updated 10 months ago
- Awesome Papers about World Models in Autonomous Driving☆86Updated last year
- [CVPR 2024] LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs☆32Updated last year
- [ECCV 2024] Asynchronous Large Language Model Enhanced Planner for Autonomous Driving☆107Updated 6 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆35Updated last year
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)☆33Updated 6 months ago
- ☆96Updated 11 months ago
- [CVPR 2024] A world model for autonomous driving.☆395Updated 2 years ago
- [NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving☆140Updated 10 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆74Updated 2 months ago
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆77Updated last year
- ☆92Updated last year
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆32Updated 2 years ago
- ☆124Updated last year