LiAutoAD / LightVLALinks
LightVLA
☆62Updated 2 weeks ago
Alternatives and similar repositories for LightVLA
Users that are interested in LightVLA are comparing it to the libraries listed below
Sorting:
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆69Updated 3 weeks ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆121Updated last year
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t …☆114Updated last year
- ☆19Updated 2 months ago
- ☆85Updated 6 months ago
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆74Updated last month
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆202Updated last month
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆73Updated 2 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆69Updated last year
- Simulator designed to generate diverse driving scenarios.☆43Updated 9 months ago
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆52Updated last week
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆67Updated 2 months ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆29Updated last year
- ☆89Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆66Updated 8 months ago
- ☆51Updated 3 weeks ago
- MiMo-Embodied☆260Updated last week
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆173Updated last month
- Official implementation of "From Forecasting to Planning: Policy World Model for Collaborative State-Action Prediction"☆34Updated this week
- ☆21Updated 3 months ago
- This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied…☆64Updated 3 weeks ago
- [CVPR 2024] On the Road to Portability: Compressing End-to-End Motion Planner for Autonomous Driving☆148Updated last year
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆44Updated 2 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆147Updated last month
- Latest Advances on Vison-Language-Action Models.☆119Updated 8 months ago
- 【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆50Updated last year
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆231Updated 2 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆84Updated 9 months ago
- [Communication in Transprotation Reasearch] Official PyTorch Implementation of ''GPT-4 enhanced multimodal grounding for autonomous driv…☆25Updated last year
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆95Updated 11 months ago