π A collection of resources and papers on Large Language Models in autonomous driving
β27Oct 30, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-DriveLM
Users that are interested in Awesome-DriveLM are comparing it to the libraries listed below
Sorting:
- ICCV'23 | Adverse Weather Removal with Codebook Priorsβ10Aug 28, 2023Updated 2 years ago
- Webpageβ16Feb 16, 2024Updated 2 years ago
- Learning to Drive with GPTβ298Feb 1, 2024Updated 2 years ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β25Jun 4, 2025Updated 9 months ago
- β229Dec 20, 2023Updated 2 years ago
- β23Nov 9, 2023Updated 2 years ago
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Drivingβ99Jan 1, 2024Updated 2 years ago
- β28Apr 8, 2025Updated 11 months ago
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'β27Oct 10, 2024Updated last year
- For Ego4D VQ3D Taskβ22Jan 9, 2024Updated 2 years ago
- RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark (ECCV 2024)β31Sep 10, 2024Updated last year
- GroundCUAβ68Dec 24, 2025Updated 2 months ago
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detectionβ110May 5, 2023Updated 2 years ago
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)β36May 29, 2025Updated 9 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token promptβ¦β30Oct 21, 2024Updated last year
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effectiveβ28Apr 4, 2024Updated last year
- A Massive Multi-Discipline Lecture Understanding Benchmarkβ33Nov 1, 2025Updated 4 months ago
- BMVC'23 | FiveA+Network: You Only Need 9K Parameters for Underwater Image Enhancementβ71Nov 5, 2023Updated 2 years ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"β201Sep 26, 2024Updated last year
- [ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answeringβ1,256Jul 2, 2025Updated 8 months ago
- On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agentβ303Mar 14, 2024Updated last year
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learningβ34Dec 16, 2024Updated last year
- Official PyTorch Implementation of BB Generator & pRoI Generator [WACV2020]β30Mar 24, 2021Updated 4 years ago
- Official PyTorch implementation of `NeuralDiff: Segmenting 3D objects that move in egocentric videos`β32May 19, 2022Updated 3 years ago
- A curated list of resources about long-context in large-language models and video understanding.β32Aug 8, 2023Updated 2 years ago
- β29Aug 6, 2025Updated 7 months ago
- [ICCV'23] Hidden Biases of End-to-End Driving Models & A starter kit for the CARLA leaderboard 2.0.β509Dec 27, 2025Updated 2 months ago
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perceptionβ57Feb 4, 2025Updated last year
- [ECCV 2024] 3D World Model for Autonomous Drivingβ525Apr 12, 2024Updated last year
- LisanBench is a lightweight benchmark for LLMs that stresses forward planning, vocabulary depth, constraint adherence, attention, and lonβ¦β23Jun 1, 2025Updated 9 months ago
- β16Jan 23, 2026Updated last month
- This is a project on visual spatial reasoning tasks-SIBenchβ25Jan 12, 2026Updated last month
- Building a multi-agent RAG system with advanced RAG methodsβ12Jan 12, 2025Updated last year
- This repository is show how to calibrate camera and lidar, inlude camera intrinsicsγcamera and lidar`s extrinsicsβ10Nov 28, 2021Updated 4 years ago
- Sparse4D v1 & v2β497Jun 25, 2024Updated last year
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environmentβ41Dec 27, 2023Updated 2 years ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detectionβ270Mar 15, 2023Updated 2 years ago
- A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)β1,695Mar 1, 2026Updated last week
- This is the official implementation of "Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation"β41Dec 1, 2024Updated last year