video-to-action / v2a-video-model-release
☆8Updated 3 months ago
Alternatives and similar repositories for v2a-video-model-release:
Users that are interested in v2a-video-model-release are comparing it to the libraries listed below
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration" (ICLR 2025 Spotlight).☆44Updated 3 months ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆27Updated last month
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆58Updated 4 months ago
- Official PyTorch implementation of AdaFlow☆51Updated 5 months ago
- The official codebase for running the experiments described in the AVDC paper.☆16Updated 6 months ago
- Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.☆20Updated 11 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 10 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆52Updated 4 months ago
- Official Implementation of the paper RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation☆30Updated last month
- Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression☆37Updated 2 months ago
- ☆30Updated 3 weeks ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆97Updated 9 months ago
- main augmentation script for real world robot dataset.☆35Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆84Updated 8 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆61Updated last week
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆28Updated 4 months ago
- [ICLR25] BID-Robot☆39Updated 5 months ago
- The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…☆23Updated last year
- ☆26Updated last month
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 3 months ago
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆72Updated 3 months ago
- ☆37Updated 4 months ago
- Hand-object interaction Pretraining From Videos☆85Updated 5 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆53Updated 7 months ago
- Cross-Embodiment Robot Learning Codebase☆44Updated last year
- ☆67Updated 6 months ago
- ☆48Updated 2 months ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Updated last year
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆19Updated 3 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year