video-language-planning / vlp_code
☆73Updated 6 months ago
Alternatives and similar repositories for vlp_code:
Users that are interested in vlp_code are comparing it to the libraries listed below
- ☆63Updated 5 months ago
- Codebase for HiP☆88Updated last year
- ☆61Updated 4 months ago
- ☆43Updated last year
- ☆22Updated last month
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆46Updated 5 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆157Updated last month
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆28Updated 10 months ago
- ☆44Updated 2 months ago
- Code for subgoal synthesis via image editing☆125Updated last year
- ☆91Updated 6 months ago
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks☆86Updated last week
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆61Updated 4 months ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆64Updated last month
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆27Updated last month
- ☆65Updated 4 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆69Updated last month
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆201Updated 10 months ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆87Updated last year
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 8 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆86Updated last month
- Code Repository for "Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations"☆47Updated 2 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆78Updated 7 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆45Updated 2 months ago
- [ICCV 2023] Official code repository for ARNOLD benchmark☆152Updated 11 months ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆76Updated 5 months ago