Benchmark and model for step-by-step reasoning in autonomous driving.
☆71Mar 15, 2025Updated last year
Alternatives and similar repositories for DriveLMM-o1
Users that are interested in DriveLMM-o1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆72Aug 12, 2024Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆101Dec 5, 2024Updated last year
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆31May 18, 2025Updated 11 months ago
- ☆596Mar 3, 2026Updated last month
- Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning☆323Mar 26, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking☆33Mar 14, 2025Updated last year
- CoRL 2025☆47Sep 20, 2025Updated 6 months ago
- SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning☆66Feb 1, 2026Updated 2 months ago
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆23Apr 16, 2025Updated last year
- ☆48Jun 13, 2025Updated 10 months ago
- [ECCV 2024 Oral] DriveLM: Driving with Graph Visual Question Answering☆1,297Jul 2, 2025Updated 9 months ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 10 months ago
- [ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving☆100Jan 1, 2024Updated 2 years ago
- [ECCV 2024] Embodied Understanding of Driving Scenarios☆209Jul 2, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆80Sep 23, 2025Updated 6 months ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated last year
- ☆415Aug 30, 2024Updated last year
- ☆72Aug 17, 2025Updated 8 months ago
- OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.☆920May 13, 2025Updated 11 months ago
- [CVPR'26] LEAD: Minimizing Learner–Expert Asymmetry in End-to-End Driving☆152Updated this week
- ☆11Oct 29, 2024Updated last year
- A collection of VLMs papers, blogs, and projects, with a focus on VLMs in Autonomous Driving and related reasoning techniques.☆11Nov 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024] MAPLM: A Large-Scale Vision-Language Dataset for Map and Traffic Scene Understanding☆164Nov 20, 2023Updated 2 years ago
- [IROS 2024] Official implementation of paper: DriVLMe: "Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experience…☆56Nov 16, 2024Updated last year
- [AAAI 2025] Language Prompt for Autonomous Driving☆157Sep 22, 2025Updated 6 months ago
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆27Oct 10, 2024Updated last year
- [AAAI 2026] OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model☆694Feb 16, 2026Updated 2 months ago
- [CoRL '25] Pseudo-Simulation for Autonomous Driving; [NeurIPS '24] NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Ben…☆944Oct 27, 2025Updated 5 months ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- [NeurIPS 2025] Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution☆58Feb 4, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ICLR 2026: Agent-X Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆39Apr 5, 2026Updated 2 weeks ago
- [ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"☆207Sep 26, 2024Updated last year
- ☆259Jun 18, 2024Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- Bridging Large Vision-Language Models and End-to-End Autonomous Driving☆545Mar 15, 2026Updated last month
- This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.☆422Jun 11, 2024Updated last year
- [TIV 2024] PolarPoint-BEV: Bird-Eye-View Perception in Polar Points for Explainable End-to-End Autonomous Driving☆23Mar 28, 2026Updated 3 weeks ago