intuitive-robots / mdt_policyLinks
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights
☆143Updated 8 months ago
Alternatives and similar repositories for mdt_policy
Users that are interested in mdt_policy are comparing it to the libraries listed below
Sorting:
- A simple testbed for robotics manipulation policies☆93Updated 2 months ago
- This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".☆157Updated 2 weeks ago
- Official implementation of GR-MG☆81Updated 5 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆190Updated last week
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆137Updated 9 months ago
- ☆94Updated last month
- ☆63Updated 4 months ago
- GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆126Updated last month
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆176Updated 2 months ago
- ☆78Updated 2 weeks ago
- Official codebase for "Any-point Trajectory Modeling for Policy Learning"☆225Updated 10 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆212Updated 3 months ago
- Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.☆127Updated last month
- RoboDual: Dual-System for Robotic Manipulation☆80Updated last month
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆259Updated last year
- ☆103Updated 8 months ago
- ☆142Updated 3 months ago
- Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets☆88Updated this week
- This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.☆105Updated this week
- Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning☆108Updated 3 months ago
- DROID Policy Learning and Evaluation☆199Updated 2 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆115Updated 6 months ago
- [RSS25] Official implementation of DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning☆160Updated 2 months ago
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆81Updated 3 months ago
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆238Updated last week
- [ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"☆80Updated last month
- Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"☆324Updated 10 months ago
- ☆187Updated last year
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆128Updated 8 months ago
- ☆114Updated 2 years ago