juruobenruo / DexVLALinks
☆390Updated 2 months ago
Alternatives and similar repositories for DexVLA
Users that are interested in DexVLA are comparing it to the libraries listed below
Sorting:
- Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation☆125Updated 3 months ago
- RoboTwin 2.0 Offical Repo☆1,053Updated this week
- ☆620Updated 2 months ago
- This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.☆449Updated 2 weeks ago
- ☆336Updated last year
- Codebase for the BestMan Mobile Manipulator Platform☆312Updated 3 weeks ago
- 多模态具身智能大模型 OpenVLA 的复现以及在 LIBERO 数据集上的微调改进☆126Updated 3 months ago
- A Scalable and Hardware-Independent Universal Manipulation Interface☆76Updated 2 months ago
- [TRO 2024] Grasp, See and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior☆62Updated 2 months ago
- [CVPR 2025 Highlight] OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints☆122Updated 2 months ago
- ☆27Updated 10 months ago
- ☆154Updated last month
- Brain-Body Co-Design in Embodied Intelligence: Taxonomy, Frontiers, and Challenges☆176Updated last week
- DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping☆283Updated last week
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆290Updated 3 weeks ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆97Updated 3 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆239Updated last week
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆267Updated 2 months ago
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io☆207Updated last month
- ✨✨latest advancements in VLA models(VIsion Language Action)☆75Updated 2 months ago
- ☆363Updated 5 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆360Updated this week
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆68Updated 8 months ago
- ☆147Updated 3 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆116Updated last year
- [ICRA 2025] Official Implementation of "Robust Robot Walker: Learning Agile Locomotion over Tiny Traps"☆46Updated last month
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆481Updated last month
- ☆246Updated 5 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆191Updated last week
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆177Updated 3 months ago