Kiteretsu77 / This_and_That_VDM
This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Planning
☆15Updated last week
Related projects ⓘ
Alternatives and complementary repositories for This_and_That_VDM
- ☆16Updated 4 months ago
- ☆29Updated 2 weeks ago
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆52Updated last month
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆41Updated 4 months ago
- ☆76Updated 2 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆37Updated 7 months ago
- ☆36Updated last week
- main augmentation script for real world robot dataset.☆31Updated last year
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆58Updated 5 months ago
- ☆43Updated 2 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆77Updated 4 months ago
- A Vision-Language Model for Spatial Affordance Prediction in Robotics☆51Updated 3 weeks ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆88Updated last month
- Mirage: a zero-shot cross-embodiment policy transfer method. Benchmarking code for cross-embodiment policy transfer.☆15Updated 6 months ago
- [RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre…☆63Updated 3 weeks ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆60Updated 3 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆68Updated last week
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆46Updated 2 weeks ago
- Code for subgoal synthesis via image editing☆112Updated last year
- [RSS 2024] Learning Manipulation by Predicting Interaction☆89Updated 2 months ago
- The official code of our ICRA'24 paper Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆58Updated 3 months ago
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆44Updated last week
- ☆68Updated 2 months ago
- [CoRL2023] Official PyTorch implementation of PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation☆32Updated 5 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆43Updated 3 months ago
- ☆36Updated 3 weeks ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆10Updated 2 months ago
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation☆80Updated last month
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆81Updated 3 months ago
- ☆57Updated 3 weeks ago