abliao / RoBridgeLinks
[ICCV2025] RoBridge: A Hierarchical Architecture Bridging Cognition and Execution for General Robotic Manipulation
☆29Updated last month
Alternatives and similar repositories for RoBridge
Users that are interested in RoBridge are comparing it to the libraries listed below
Sorting:
- ☆80Updated last month
- ☆55Updated 6 months ago
- Official Code For VLA-OS.☆101Updated 2 months ago
- ☆58Updated last month
- The Official Implementation of RoboMatrix☆96Updated 3 months ago
- ☆64Updated 6 months ago
- ICCV2025☆114Updated last week
- ☆81Updated 3 months ago
- ☆55Updated 7 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆178Updated 3 months ago
- ☆28Updated last month
- Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning☆89Updated last month
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆45Updated 4 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆118Updated 10 months ago
- Galaxea's first VLA release☆158Updated this week
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆158Updated last week
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆123Updated 2 months ago
- ✨✨Official implementation of BridgeVLA☆120Updated 2 months ago
- A comprehensive list of papers about dual-system VLA models, including papers, codes, and related websites.☆69Updated last month
- 🦾 A Dual-System VLA with System2 Thinking☆99Updated last week
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆73Updated 3 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆97Updated last year
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆277Updated 2 months ago
- Official implementation of the paper: Task Reconstruction and Extrapolation for $\pi_0$ using Text Latent (https://arxiv.org/pdf/2505.035…☆67Updated 3 weeks ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆151Updated 10 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆25Updated 3 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆113Updated 2 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆138Updated 4 months ago
- GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆207Updated last month
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆134Updated this week