intuitive-robots / mdt_policy
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights
☆93Updated 4 months ago
Alternatives and similar repositories for mdt_policy:
Users that are interested in mdt_policy are comparing it to the libraries listed below
- A simple testbed for robotics manipulation policies☆75Updated 3 weeks ago
- Official implementation of GR-MG☆68Updated last month
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆115Updated 5 months ago
- ☆65Updated 3 months ago
- ☆72Updated 4 months ago
- ☆33Updated 2 months ago
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆67Updated 7 months ago
- A unified architecture for multimodal multi-task robotic policy learning.☆134Updated last year
- [ICRA'24] Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning☆62Updated 6 months ago
- Official Code Repo for GENIMA☆64Updated 4 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆72Updated this week
- Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning☆83Updated 7 months ago
- DROID Policy Learning and Evaluation☆164Updated last month
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆57Updated 4 months ago
- Cross-Embodiment Robot Learning Codebase☆41Updated 9 months ago
- [CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation☆148Updated 10 months ago
- ☆64Updated last month
- The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"☆60Updated 3 months ago
- code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation☆76Updated 6 months ago
- ☆95Updated last year
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆134Updated 5 months ago
- Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation☆74Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆75Updated last week
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆217Updated 9 months ago
- ☆59Updated this week
- A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks☆81Updated 2 weeks ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆104Updated 4 months ago
- ☆34Updated 9 months ago
- [CoRL 2024 Oral] Equivariant Diffusion Policy☆72Updated this week