rese1f / Awesome-DriveLMView external linksLinks
π A collection of resources and papers on Large Language Models in autonomous driving
β27Oct 30, 2023Updated 2 years ago
Alternatives and similar repositories for Awesome-DriveLM
Users that are interested in Awesome-DriveLM are comparing it to the libraries listed below
Sorting:
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimationβ24Aug 26, 2023Updated 2 years ago
- ICCV'23 | Adverse Weather Removal with Codebook Priorsβ10Aug 28, 2023Updated 2 years ago
- Git Repository for the ContextVLM paper and the DrivingContexts datasetβ14Sep 13, 2024Updated last year
- Webpageβ16Feb 16, 2024Updated last year
- Learning to Drive with GPTβ297Feb 1, 2024Updated 2 years ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β25Jun 4, 2025Updated 8 months ago
- β229Dec 20, 2023Updated 2 years ago
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Drivingβ98Jan 1, 2024Updated 2 years ago
- β28Apr 8, 2025Updated 10 months ago
- For Ego4D VQ3D Taskβ22Jan 9, 2024Updated 2 years ago
- Exploring Large Language Models for Trajectory Prediction: A Technical Perspectiveβ27Jun 12, 2024Updated last year
- RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark (ECCV 2024)β31Sep 10, 2024Updated last year
- A paper list of world modelβ29Apr 10, 2025Updated 10 months ago
- GroundCUAβ67Dec 24, 2025Updated last month
- PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"β570Sep 26, 2024Updated last year
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detectionβ110May 5, 2023Updated 2 years ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token promptβ¦β30Oct 21, 2024Updated last year
- Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving (ICCV 2025)β36May 29, 2025Updated 8 months ago
- BMVC'23 | FiveA+Network: You Only Need 9K Parameters for Underwater Image Enhancementβ71Nov 5, 2023Updated 2 years ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"β201Sep 26, 2024Updated last year
- [ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answeringβ1,246Jul 2, 2025Updated 7 months ago
- On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agentβ302Mar 14, 2024Updated last year
- Official PyTorch implementation of `NeuralDiff: Segmenting 3D objects that move in egocentric videos`β32May 19, 2022Updated 3 years ago
- Official PyTorch Implementation of BB Generator & pRoI Generator [WACV2020]β30Mar 24, 2021Updated 4 years ago
- The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learningβ34Dec 16, 2024Updated last year
- A curated list of resources about long-context in large-language models and video understanding.β31Aug 8, 2023Updated 2 years ago
- [ICCV'23] Hidden Biases of End-to-End Driving Models & A starter kit for the CARLA leaderboard 2.0.β502Dec 27, 2025Updated last month
- [ECCV 2024] 3D World Model for Autonomous Drivingβ524Apr 12, 2024Updated last year
- This is a project on visual spatial reasoning tasks-SIBenchβ25Jan 12, 2026Updated last month
- Deep Learning Project for Trajectory Prediction using nuScenes dataset.β10Sep 13, 2022Updated 3 years ago
- A simple exam generator and grader written in Python with OpenCVβ14Jan 14, 2026Updated last month
- Building a multi-agent RAG system with advanced RAG methodsβ12Jan 12, 2025Updated last year
- This repository is show how to calibrate camera and lidar, inlude camera intrinsicsγcamera and lidar`s extrinsicsβ10Nov 28, 2021Updated 4 years ago
- Self-similarity Prior Distillation for Unsupervised Remote Physiological Measurementβ10Oct 18, 2024Updated last year
- Sparse4D v1 & v2β497Jun 25, 2024Updated last year
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environmentβ41Dec 27, 2023Updated 2 years ago
- [ECCV 2020] Code release for "Resolution Switchable Networks for Runtime Efficient Image Recognition"β40Aug 11, 2020Updated 5 years ago
- Featurized Query R-CNNβ45Jun 17, 2022Updated 3 years ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detectionβ269Mar 15, 2023Updated 2 years ago