alibaba-damo-academy / RynnVLA-001Links
RynnVLA-001: A Vision-Language-Action Model Boosted by Generative Priors
☆42Updated this week
Alternatives and similar repositories for RynnVLA-001
Users that are interested in RynnVLA-001 are comparing it to the libraries listed below
Sorting:
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆119Updated 3 months ago
- Unified Vision-Language-Action Model☆170Updated 3 weeks ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆47Updated this week
- ICCV2025☆112Updated this week
- Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos☆126Updated this week
- ☆53Updated 7 months ago
- ☆82Updated 2 weeks ago
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆123Updated 2 weeks ago
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆58Updated 2 weeks ago
- WorldVLA: Towards Autoregressive Action World Model☆323Updated last month
- 🦾 A Dual-System VLA with System2 Thinking☆84Updated 3 weeks ago
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆143Updated last week
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆184Updated last month
- InternRobotics' open-source toolbox for vision-based embodied spatial intelligence.☆32Updated 2 weeks ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆72Updated 8 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆171Updated 2 months ago
- List of papers on video-centric robot learning☆21Updated 8 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆255Updated 2 weeks ago
- ☆55Updated 5 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆184Updated 3 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆42Updated 4 months ago
- official repo for AGNOSTOS, a cross-task manipulation benchmark, and X-ICM method, a cross-task in-context manipulation (VLA) method☆35Updated last month
- Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆37Updated last month
- ☆106Updated last month
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆96Updated 3 months ago
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [Actively Maintained🔥]☆122Updated last week
- Official Code For VLA-OS.☆78Updated last month
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆153Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆135Updated 4 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆60Updated 2 weeks ago