video-language-planning / vlp_code
☆76Updated 8 months ago
Alternatives and similar repositories for vlp_code:
Users that are interested in vlp_code are comparing it to the libraries listed below
- ☆68Updated 7 months ago
- Codebase for HiP☆89Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- ☆46Updated 4 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆61Updated last week
- ☆44Updated last year
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆58Updated 4 months ago
- ☆65Updated 6 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆168Updated last month
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆209Updated 11 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆53Updated 7 months ago
- Code for subgoal synthesis via image editing☆133Updated last year
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆61Updated 6 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated 3 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆97Updated 9 months ago
- ☆67Updated 6 months ago
- ☆98Updated 8 months ago
- ☆24Updated last year
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆70Updated 3 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 9 months ago
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆27Updated last month
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆129Updated last year
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆69Updated last month
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆32Updated last month
- ☆30Updated 3 weeks ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆84Updated 8 months ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆91Updated 2 years ago
- ☆68Updated last week
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆110Updated 2 weeks ago
- Official PyTorch implementation of AdaFlow☆51Updated 5 months ago