roboterax / video-prediction-policyLinks
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆259Updated 3 months ago
Alternatives and similar repositories for video-prediction-policy
Users that are interested in video-prediction-policy are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆199Updated 5 months ago
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆274Updated this week
- ☆394Updated 7 months ago
- Building General-Purpose Robots Based on Embodied Foundation Model☆265Updated this week
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆283Updated 2 months ago
- WorldVLA: Towards Autoregressive Action World Model☆384Updated 2 weeks ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆277Updated last year
- Galaxea's first VLA release☆215Updated this week
- [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.☆306Updated 3 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆482Updated 2 months ago
- DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping☆366Updated last month
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆225Updated 2 months ago
- A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation☆341Updated 3 months ago
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆183Updated 3 months ago
- OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation☆259Updated 2 weeks ago
- ☆252Updated this week
- GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆215Updated last month
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆723Updated 3 weeks ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆285Updated last month
- Official Code For VLA-OS.☆105Updated 2 months ago
- ICCV2025☆125Updated 2 weeks ago
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆187Updated 10 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆261Updated last month
- The Simulation Framework from AgiBot☆276Updated 3 weeks ago
- [RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning☆198Updated last month
- H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation☆80Updated last week
- This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".☆264Updated 3 months ago
- Pytorch PI-zero and PI-zero-fast. Adapted from LeRobot☆119Updated last week
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆303Updated 5 months ago
- ☆142Updated 3 weeks ago