Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆79Dec 12, 2024Updated last year
Alternatives and similar repositories for FLIP
Users that are interested in FLIP are comparing it to the libraries listed below
Sorting:
- [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos☆164Oct 1, 2025Updated 5 months ago
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆90Jul 21, 2025Updated 7 months ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆122May 8, 2025Updated 9 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆280Jul 8, 2025Updated 7 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆338Jul 23, 2025Updated 7 months ago
- HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction☆41Sep 15, 2025Updated 5 months ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSP☆73Jan 14, 2026Updated last month
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆106Nov 21, 2024Updated last year
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆189Oct 8, 2025Updated 4 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆70Dec 20, 2024Updated last year
- ☆34Mar 11, 2025Updated 11 months ago
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆21Jul 4, 2023Updated 2 years ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆28Sep 28, 2025Updated 5 months ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆36Jan 22, 2025Updated last year
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Jun 6, 2025Updated 8 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆248Apr 25, 2024Updated last year
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆99Jul 31, 2024Updated last year
- [ECCV 2024, Oral] UGG: Unified Generative Grasping☆55Apr 7, 2025Updated 10 months ago
- Author's implementation of DemoDiffusion.☆61Jan 14, 2026Updated last month
- ☆94Sep 4, 2024Updated last year
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆343Mar 19, 2025Updated 11 months ago
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆26Sep 25, 2025Updated 5 months ago
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆1,051Sep 9, 2025Updated 5 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,262Oct 17, 2025Updated 4 months ago
- Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)☆197Feb 28, 2024Updated 2 years ago
- ☆78May 23, 2025Updated 9 months ago
- ☆80Oct 21, 2024Updated last year
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆472Jan 22, 2025Updated last year
- Official Implementation of the paper RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation☆39Jan 13, 2026Updated last month
- [CoRL 2025 Best Paper Award] Fabrica: Dual-Arm Assembly of General Multi-Part Objects via Integrated Planning and Learning☆64Jan 11, 2026Updated last month
- [CVPR 2024 Highlight] Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation☆59Apr 5, 2024Updated last year
- speed-running solving robot manipulation tasks☆24Oct 31, 2024Updated last year
- [TASE 2025] Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter☆35Oct 27, 2025Updated 4 months ago
- Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.☆48Jan 4, 2025Updated last year
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆273Jun 19, 2025Updated 8 months ago
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆384Aug 17, 2024Updated last year
- StructDiffusion: Language-Guided Creation of Physically-Valid Structures using Unseen Objects☆59Jul 10, 2023Updated 2 years ago
- ☆29Dec 9, 2025Updated 2 months ago
- Official code repository of paper "D(R, O) Grasp: A Unified Representation of Robot and Object Interaction for Cross-Embodiment Dexterous…☆261Nov 13, 2025Updated 3 months ago