video-to-action / video-to-action-release
Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".
☆41Updated last month
Alternatives and similar repositories for video-to-action-release:
Users that are interested in video-to-action-release are comparing it to the libraries listed below
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 7 months ago
- ☆61Updated 5 months ago
- Code for Stable Control Representations☆23Updated last month
- ☆91Updated 6 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆41Updated 2 months ago
- ☆21Updated 3 weeks ago
- G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆34Updated 2 weeks ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆24Updated last month
- List of papers on video-centric robot learning☆14Updated 3 months ago
- ☆73Updated 5 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆91Updated 7 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Plannin…☆28Updated last week
- main augmentation script for real world robot dataset.☆34Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆77Updated 6 months ago
- ☆43Updated 2 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆76Updated 2 weeks ago
- ☆18Updated 7 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 3 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 3 weeks ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆63Updated 4 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆72Updated last week
- ☆61Updated 3 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆46Updated 4 months ago
- Official implementation of "Self-Improving Video Generation"☆60Updated last month
- [ICCV 2023] Understanding 3D Object Interaction from a Single Image☆41Updated 11 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆39Updated last year
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆114Updated last month