worldbench/awesome-vla-for-ad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/worldbench/awesome-vla-for-ad)

worldbench / awesome-vla-for-ad

🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future

☆317

Alternatives and similar repositories for awesome-vla-for-ad

Users that are interested in awesome-vla-for-ad are comparing it to the libraries listed below

Sorting:

worldbench / awesome-3d-in-the-wild
View on GitHub
🌐 A Roadmap for 3D Scene Understanding in the Wild
☆23Dec 19, 2025Updated 2 months ago
worldbench / Calib3D
View on GitHub
[WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding
☆70Dec 6, 2025Updated 3 months ago
lixiaoyu2000 / HAT
View on GitHub
Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"
☆29Jan 13, 2026Updated last month
yuyang-cloud / X-Scene
View on GitHub
Implementation of paper "𝒳-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability"
☆32Jun 17, 2025Updated 8 months ago
BraveGroup / DriveVLA-W0
View on GitHub
DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving (ICLR 2026)
☆315Feb 11, 2026Updated 3 weeks ago
BraveGroup / LAW
View on GitHub
(ICLR2025) Enhancing End-to-End Autonomous Driving with Latent World Model
☆318Jun 29, 2025Updated 8 months ago
valeoai / PointBeV
View on GitHub
Official implementation of PointBeV: A Sparse Approach to BeV Predictions
☆139Mar 7, 2024Updated 2 years ago
autodriving-heart / NeurIPS2024-Papers-about-Autonomous-Driving
View on GitHub
NeurIPS2024-Papers-about-Autonomous-Driving
☆19Nov 18, 2024Updated last year
RenzKa / simlingo
View on GitHub
[CVPR 2025, Spotlight] SimLingo (CarLLava): Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
☆363Aug 25, 2025Updated 6 months ago
V2AI / nuCraft_API
View on GitHub
High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.
☆29Jul 14, 2024Updated last year
worldbench / SPIRAL
View on GitHub
[NeurIPS 2025] SPIRAL: Semantic-Aware Progressive LiDAR Scene Generation and Understanding
☆43Nov 30, 2025Updated 3 months ago
songw-zju / Scribble2Scene
View on GitHub
The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)
☆14Jul 27, 2024Updated last year
autodriving-heart / ECCV-2024-Papers-Autonomous-Driving
View on GitHub
ECCV 2024 Paper List about Autonomous Driving
☆125Oct 5, 2024Updated last year
worldbench / Robo3D
View on GitHub
[ICCV 2023] Robo3D: Towards Robust and Reliable 3D Perception against Corruptions
☆373Dec 6, 2025Updated 3 months ago
MasterHow / OneOcc
View on GitHub
An official implementation for "OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera"
☆29Nov 6, 2025Updated 4 months ago
valeoai / DrivoR
View on GitHub
[CVPR2026] DrivoR: an end-to-end driving model by driving on registers
☆137Mar 2, 2026Updated last week
VISION-SJTU / SparseOcc
View on GitHub
Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…
☆71Aug 5, 2024Updated last year
lucifer443 / RecurrentBEV
View on GitHub
[ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
☆33Sep 28, 2024Updated last year
xiaomi-mlab / MindDrive
View on GitHub
Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”
☆179Feb 12, 2026Updated 3 weeks ago
MSunDYY / SparseWorld
View on GitHub
[AAAI 2026] SparseWorld: A Flexible, Adaptive, and Efficient 4D Occupancy World Model Powered by Sparse and Dynamic Queries
☆40Jan 14, 2026Updated last month
placeforyiming / IROS21-FIDNet-SemanticKITTI
View on GitHub
An extremely simple, intuitive, hardware-friendly, and well-performing network structure for LiDAR semantic segmentation on 2D range imag…
☆72Sep 17, 2021Updated 4 years ago
zzj403 / BEV_Robust
View on GitHub
[CVPR 2023] Understanding the Robustness of 3D Object Detection With Bird's-Eye-View Representations in Autonomous Driving
☆31Apr 3, 2024Updated last year
chaytonmin / UniScene
View on GitHub
Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving
☆236Feb 15, 2024Updated 2 years ago
StudyingFuFu / L2COcc
View on GitHub
☆22Mar 18, 2025Updated 11 months ago
Public-BOTs / TiGDistill-BEV
View on GitHub
☆23Dec 31, 2024Updated last year
liyingyanUCAS / WoTE
View on GitHub
(ICCV2025) End-to-End Driving with Online Trajectory Evaluation via BEV World Model
☆200Jun 29, 2025Updated 8 months ago
songw-zju / PointLoRA
View on GitHub
The official implementation of "PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning" (CVPR 2025)
☆28Oct 31, 2025Updated 4 months ago
chenhaomingbob / CSC
View on GitHub
[CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …
☆17Jun 11, 2024Updated last year
rolsheng / MM-VUFM4DS
View on GitHub
【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆51May 26, 2024Updated last year
TabGuigui / FipTR
View on GitHub
☆22Aug 3, 2024Updated last year
ucaszyp / World4Drive
View on GitHub
[ICCV 2025]
☆65Dec 31, 2025Updated 2 months ago
sbysbysbys / AFOV
View on GitHub
☆22Jan 22, 2025Updated last year
PKUHaoWang / EmbodiedOcc2
View on GitHub
[ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler
☆26Aug 7, 2025Updated 7 months ago
ywyeli / UniDrive
View on GitHub
[ICLR'25] UniDrive: Towards Universal Driving Perception Across Camera Configurations
☆92Jul 2, 2025Updated 8 months ago
jbwang1997 / OPUS
View on GitHub
OPUS: Occupancy Prediction Using a Sparse Set
☆149Jan 5, 2026Updated 2 months ago
cdb342 / ALOcc
View on GitHub
[ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction
☆41Dec 1, 2025Updated 3 months ago
Daniel-xsy / BEV-Attack
View on GitHub
[TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection
☆31Apr 23, 2024Updated last year
chaytonmin / Awesome-Occupancy-Prediction-Autonomous-Driving
View on GitHub
Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy
☆259Jul 10, 2023Updated 2 years ago
PeidongLi / SSR
View on GitHub
[ICLR 2025] The official implementation of SSR
☆244Mar 23, 2025Updated 11 months ago