Kiteretsu77 / This_and_That_VDM
This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Planning (ICRA 2025)
☆34Updated last month
Alternatives and similar repositories for This_and_That_VDM:
Users that are interested in This_and_That_VDM are comparing it to the libraries listed below
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆48Updated 3 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 2 months ago
- ☆67Updated 6 months ago
- ☆94Updated 7 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆47Updated 3 months ago
- List of papers on video-centric robot learning☆14Updated 4 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆44Updated 9 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆55Updated 2 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆68Updated 5 months ago
- main augmentation script for real world robot dataset.☆35Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆80Updated 7 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆52Updated 2 weeks ago
- ☆18Updated 8 months ago
- Unified Video Action Model☆123Updated this week
- Official implementation of "Re3Sim: Generating High-Fidelity Simulation Data via 3D-Photorealistic Real-to-Sim for Robotic Manipulation"☆75Updated last week
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆138Updated 4 months ago
- ☆56Updated last week
- ☆84Updated 2 weeks ago
- ☆46Updated last week
- ☆37Updated this week
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆109Updated 5 months ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆66Updated 2 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆38Updated 3 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆17Updated 2 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆107Updated last week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆96Updated 4 months ago
- Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression☆33Updated last month
- ☆32Updated 2 weeks ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆112Updated 2 weeks ago
- ☆46Updated 3 months ago