intuitive-robots/mdt_policy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intuitive-robots/mdt_policy)

intuitive-robots / mdt_policy

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

☆168

Alternatives and similar repositories for mdt_policy

Users that are interested in mdt_policy are comparing it to the libraries listed below

Sorting:

bytedance / GR-MG
View on GitHub
Official implementation of GR-MG
☆93Jan 12, 2025Updated last year
EDiRobotics / GR1-Training
View on GitHub
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆147Sep 4, 2024Updated last year
intuitive-robots / MoDE_Diffusion_Policy
View on GitHub
[ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"
☆117May 16, 2025Updated 9 months ago
EDiRobotics / mimictest
View on GitHub
A simple testbed for robotics manipulation policies
☆103Apr 13, 2025Updated 10 months ago
bytedance / GR-1
View on GitHub
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆300Apr 22, 2024Updated last year
nickgkan / 3d_diffuser_actor
View on GitHub
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
☆384Aug 17, 2024Updated last year
intuitive-robots / beso
View on GitHub
[RSS 2023] Official code for "Goal Conditioned Imitation Learning using Score-based Diffusion Policies"
☆89Dec 1, 2023Updated 2 years ago
intuitive-robots / NILS
View on GitHub
[CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"
☆28Dec 11, 2024Updated last year
ManiCM-fast / ManiCM
View on GitHub
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
☆122May 8, 2025Updated 9 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆443Nov 29, 2025Updated 3 months ago
jayLEE0301 / vq_bet_official
View on GitHub
Official code for "Behavior Generation with Latent Actions" (ICML 2024 Spotlight)
☆197Feb 28, 2024Updated 2 years ago
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆280Jul 8, 2025Updated 7 months ago
Large-Trajectory-Model / ATM
View on GitHub
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆273Jun 19, 2025Updated 8 months ago
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆841Sep 8, 2025Updated 5 months ago
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆980Dec 20, 2025Updated 2 months ago
octo-models / octo
View on GitHub
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
☆1,552Jul 31, 2024Updated last year
Nimolty / RoboKeyGen
View on GitHub
☆19Jul 7, 2024Updated last year
intuitive-robots / flower_vla_calvin
View on GitHub
[CoRL 25] Code for FLOWER VLA for finetuning FLOWER on CALVIN and all LIBERO environments
☆77Sep 22, 2025Updated 5 months ago
MichalZawalski / embodied-CoT
View on GitHub
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆369Apr 5, 2025Updated 10 months ago
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,625Jan 21, 2026Updated last month
flow-diffusion / AVDC
View on GitHub
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆248Apr 25, 2024Updated last year
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆164Oct 1, 2025Updated 5 months ago
MohitShridhar / genima
View on GitHub
Official Code Repo for GENIMA
☆77Oct 29, 2025Updated 4 months ago
kpertsch / rlds_dataset_mod
View on GitHub
Efficiently apply modification functions to RLDS/TFDS datasets.
☆41Jun 5, 2024Updated last year
SudeepDasari / dit-policy
View on GitHub
☆144Oct 15, 2024Updated last year
robot-colosseum / robot-colosseum
View on GitHub
A Benchmark for Evaluating Generalization for Robotic Manipulation
☆146Mar 3, 2025Updated last year
cvlab-columbia / dreamitate
View on GitHub
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)
☆58Jun 7, 2025Updated 8 months ago
Aaditya-Prasad / consistency-policy
View on GitHub
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
☆197Jul 20, 2024Updated last year
dyson-ai / hdp
View on GitHub
[CVPR 2024] Hierarchical Diffusion Policy for Multi-Task Robotic Manipulation
☆228Apr 9, 2024Updated last year
Dantong88 / LLARVA
View on GitHub
☆62Dec 14, 2024Updated last year
RoboFlamingo / RoboFlamingo
View on GitHub
Code for RoboFlamingo
☆424May 8, 2024Updated last year
intuitive-robots / vdd
View on GitHub
[NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"
☆17Dec 7, 2024Updated last year
siddhanthaldar / BAKU
View on GitHub
Code for BAKU: An Efficient Transformer for Multi-Task Policy Learning
☆129Mar 16, 2025Updated 11 months ago
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆657Jun 23, 2025Updated 8 months ago
HeegerGao / FLIP
View on GitHub
Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
☆79Dec 12, 2024Updated last year
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆1,517Mar 15, 2025Updated 11 months ago
Tavish9 / any4lerobot
View on GitHub
🎁 A collection of utilities for LeRobot.
☆873Feb 7, 2026Updated 3 weeks ago
juruobenruo / DexVLA
View on GitHub
☆41Apr 15, 2025Updated 10 months ago
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,051Sep 9, 2025Updated 5 months ago