rese1f/Awesome-DriveLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rese1f/Awesome-DriveLM)

rese1f / Awesome-DriveLM

📚 A collection of resources and papers on Large Language Models in autonomous driving

☆27

Alternatives and similar repositories for Awesome-DriveLM

Users that are interested in Awesome-DriveLM are comparing it to the libraries listed below

Sorting:

Owen718 / AWRCP
View on GitHub
ICCV'23 | Adverse Weather Removal with Codebook Priors
☆10Aug 28, 2023Updated 2 years ago
concept-fusion / concept-fusion.github.io
View on GitHub
Webpage
☆16Feb 16, 2024Updated 2 years ago
PointsCoder / GPT-Driver
View on GitHub
Learning to Drive with GPT
☆298Feb 1, 2024Updated 2 years ago
Andy-Cheng / TEMPURA
View on GitHub
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…
☆25Jun 4, 2025Updated 9 months ago
E2E-AD / AD-MLP
View on GitHub
☆229Dec 20, 2023Updated 2 years ago
jiachenlei / maskdm
View on GitHub
☆23Nov 9, 2023Updated 2 years ago
fudan-zvg / Reason2Drive
View on GitHub
[ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
☆99Jan 1, 2024Updated 2 years ago
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆28Apr 8, 2025Updated 11 months ago
ZionGo6 / VLM-Auto
View on GitHub
Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'
☆27Oct 10, 2024Updated last year
Wayne-Mai / EgoLoc
View on GitHub
For Ego4D VQ3D Task
☆22Jan 9, 2024Updated 2 years ago
ipl-uw / RT-POSE
View on GitHub
RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark (ECCV 2024)
☆31Sep 10, 2024Updated last year
ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆68Dec 24, 2025Updated 2 months ago
kaixinbear / CAPE
View on GitHub
(CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
☆110May 5, 2023Updated 2 years ago
Depth2World / VLADBench
View on GitHub
Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)
☆36May 29, 2025Updated 9 months ago
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
reachpranjal / lego-drive
View on GitHub
[Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective
☆28Apr 4, 2024Updated last year
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆33Nov 1, 2025Updated 4 months ago
Owen718 / FiveAPlus-Network
View on GitHub
BMVC'23 | FiveA+Network: You Only Need 9K Parameters for Underwater Image Enhancement
☆71Nov 5, 2023Updated 2 years ago
wayveai / LingoQA
View on GitHub
[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"
☆201Sep 26, 2024Updated last year
OpenDriveLab / DriveLM
View on GitHub
[ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering
☆1,256Jul 2, 2025Updated 8 months ago
PJLab-ADG / GPT4V-AD-Exploration
View on GitHub
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
☆303Mar 14, 2024Updated last year
XiaoMi / CGNet
View on GitHub
The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning
☆34Dec 16, 2024Updated last year
kemaloksuz / BoundingBoxGenerator
View on GitHub
Official PyTorch Implementation of BB Generator & pRoI Generator [WACV2020]
☆30Mar 24, 2021Updated 4 years ago
dichotomies / NeuralDiff
View on GitHub
Official PyTorch implementation of `NeuralDiff: Segmenting 3D objects that move in egocentric videos`
☆32May 19, 2022Updated 3 years ago
showlab / Awesome-Long-Context
View on GitHub
A curated list of resources about long-context in large-language models and video understanding.
☆32Aug 8, 2023Updated 2 years ago
GaavaMa / Causal-Diffusion-Policy
View on GitHub
☆29Aug 6, 2025Updated 7 months ago
autonomousvision / carla_garage
View on GitHub
[ICCV'23] Hidden Biases of End-to-End Driving Models & A starter kit for the CARLA leaderboard 2.0.
☆509Dec 27, 2025Updated 2 months ago
taco-group / STAMP
View on GitHub
[ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception
☆57Feb 4, 2025Updated last year
wzzheng / OccWorld
View on GitHub
[ECCV 2024] 3D World Model for Autonomous Driving
☆525Apr 12, 2024Updated last year
voice-from-the-outer-world / lisan-bench
View on GitHub
LisanBench is a lightweight benchmark for LLMs that stresses forward planning, vocabulary depth, constraint adherence, attention, and lon…
☆23Jun 1, 2025Updated 9 months ago
MachineConsciousness / Machine-Consciousness-HKUSTGZ-AIAA5050
View on GitHub
☆16Jan 23, 2026Updated last month
song2yu / SIBench-VSR
View on GitHub
This is a project on visual spatial reasoning tasks-SIBench
☆25Jan 12, 2026Updated last month
yarikama / Agentic-Advanced-RAG
View on GitHub
Building a multi-agent RAG system with advanced RAG methods
☆12Jan 12, 2025Updated last year
Dysonsun / lidar_camera_calibration
View on GitHub
This repository is show how to calibrate camera and lidar, inlude camera intrinsics、camera and lidar`s extrinsics
☆10Nov 28, 2021Updated 4 years ago
linxuewu / Sparse4D
View on GitHub
Sparse4D v1 & v2
☆497Jun 25, 2024Updated last year
rese1f / STEVE
View on GitHub
[ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment
☆41Dec 27, 2023Updated 2 years ago
Divadi / SOLOFusion
View on GitHub
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
☆270Mar 15, 2023Updated 2 years ago
Thinklab-SJTU / Awesome-LLM4AD
View on GitHub
A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)
☆1,695Mar 1, 2026Updated last week
ipl-uw / ZeDO-Release
View on GitHub
This is the official implementation of "Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation"
☆41Dec 1, 2024Updated last year