video-to-action / video-to-action-release
Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".
☆43Updated 2 months ago
Alternatives and similar repositories for video-to-action-release:
Users that are interested in video-to-action-release are comparing it to the libraries listed below
- ☆67Updated 6 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆48Updated 3 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 8 months ago
- Code for Stable Control Representations☆24Updated 2 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆26Updated 2 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆47Updated 3 months ago
- ☆94Updated 7 months ago
- ☆75Updated 7 months ago
- List of papers on video-centric robot learning☆14Updated 4 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆94Updated 8 months ago
- ☆46Updated 3 months ago
- main augmentation script for real world robot dataset.☆35Updated last year
- ☆42Updated 10 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆80Updated 7 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 2 months ago
- ☆28Updated 2 weeks ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆51Updated 2 weeks ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆77Updated this week
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 5 months ago
- ☆62Updated 5 months ago
- ☆21Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year
- ☆19Updated 7 months ago
- Code Repository for "Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations"☆49Updated 3 months ago
- ☆18Updated 8 months ago
- Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.☆20Updated 10 months ago