OpenHelix-Team / VLA-AdapterLinks
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
☆1,723Updated last week
Alternatives and similar repositories for VLA-Adapter
Users that are interested in VLA-Adapter are comparing it to the libraries listed below
Sorting:
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆731Updated 3 months ago
- Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation☆160Updated 4 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆86Updated this week
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆362Updated last month
- RoboTwin 2.0 Offical Repo☆1,694Updated this week
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆103Updated this week
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆71Updated 8 months ago
- Codebase for the BestMan Mobile Manipulator Platform☆324Updated 5 months ago
- ☆34Updated last year
- CoNav : Collaborative Cross-Modal Reasoning for Embodied Navigation☆17Updated 6 months ago
- RynnEC: Bringing MLLMs into Embodied World☆380Updated last month
- This repository serves as a central navigator for the various components of my Final Year Project (FYP).☆22Updated 5 months ago
- ☆979Updated last month
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆170Updated this week
- ☆118Updated last week
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆68Updated this week
- ☆545Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆222Updated this week
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆164Updated 3 weeks ago
- Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"☆137Updated this week
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆252Updated last month
- [ICML 2025 Poster] Official PyTorch Implementation of "Habitizing Diffusion Planning for Efficient and Effective Decision Making"☆35Updated 6 months ago
- ☆93Updated 4 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆132Updated this week
- FreeTacMan: Robot-free Visuo-Tactile Data Collection System for Contact-rich Manipulation☆104Updated 3 weeks ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆101Updated 8 months ago
- Any-step Dynamics Model for Policy Optimization☆64Updated 9 months ago
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆217Updated last month
- ☆247Updated 10 months ago
- The code of paper "LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning" accepted by ICLR'25☆139Updated last month