LiAutoAD / LightVLALinks
LightVLA
☆66Updated last week
Alternatives and similar repositories for LightVLA
Users that are interested in LightVLA are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 months ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆122Updated last year
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆115Updated last year
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆69Updated 3 months ago
- Benchmark and model for step-by-step reasoning in autonomous driving.☆67Updated 9 months ago
- MiMo-Embodied☆304Updated last month
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆209Updated this week
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆173Updated last month
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆74Updated 2 months ago
- ☆87Updated 7 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆70Updated last month
- ☆57Updated last week
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆83Updated last month
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆70Updated last year
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆325Updated 2 months ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆29Updated last year
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆158Updated last month
- Simulator designed to generate diverse driving scenarios.☆43Updated 9 months ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆101Updated 6 months ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆251Updated 3 months ago
- ☆15Updated last year
- Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”☆69Updated this week
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆52Updated last month
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆54Updated 5 months ago
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆107Updated 2 months ago
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆49Updated 3 months ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆312Updated this week
- Official repo for IRL-VLA☆69Updated 4 months ago
- Adapting VLMs to Bench2Drive.☆171Updated 2 months ago
- [ACM CSUR 2025] Understanding World or Predicting Future? A Comprehensive Survey of World Models☆300Updated last month