Mondo-Robotics/DiT4DiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mondo-Robotics/DiT4DiT)

Mondo-Robotics / DiT4DiT

This is the official code repo for DiT4DiT, a Vision-Action-Model (VAM) framework that combines video generation model with flow-matching-based action prediction for generalizable robotic manipulation.

☆412

Alternatives and similar repositories for DiT4DiT

Users that are interested in DiT4DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVlabs / GRAIL
View on GitHub
A digital data-generation pipeline that synthesizes humanoid loco-manipulation data from 3D assets and video priors.
☆433Updated this week
GalaxyGeneralRobotics / Humanoid-GPT
View on GitHub
[CVPR 2026] Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking
☆401Jul 22, 2026Updated last week
Renforce-Dynamics / MultiModalWBC
View on GitHub
MultiModalWBC is a fully open-source, IsaacLab-based framework for multi-modal whole-body control, designed for motion imitation, motion …
☆186Jun 16, 2026Updated last month
physical-superintelligence-lab / SIMPLE
View on GitHub
Welcome to SIMPLE, a full-stack simulation environment for humanoid loco-manipulation, built on AMO/SONIC, with integrated support for ma…
☆196Jul 21, 2026Updated last week
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,220Apr 3, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Perkins729 / OmniXtreme
View on GitHub
☆713Apr 22, 2026Updated 3 months ago
OpenDriveLab / EgoHumanoid
View on GitHub
[RSS 2026] The first framework enabling humanoid robots to learn whole-body loco-manipulation from egocentric human demos
☆217Jun 6, 2026Updated last month
LeCAR-Lab / HDMI
View on GitHub
☆630Jan 17, 2026Updated 6 months ago
physical-superintelligence-lab / Psi0
View on GitHub
[RSS26'] Welcome to Psi-Zero, a Humanoid VLA towards Universal Humanoid Intelligence.
☆2,736Jul 17, 2026Updated last week
NVlabs / GR00T-VisualSim2Real
View on GitHub
GR00T-VisualSim2Real: Open-source sim-to-real framework for humanoid visual loco-manipulation. Train in simulation, deploy zero-shot on r…
☆363Apr 20, 2026Updated 3 months ago
LeCAR-Lab / BFM-Zero
View on GitHub
BFM_Zero: A Promptable Behavioral Foundation Model for Humanoid Control Using Unsupervised Reinforcement Learning
☆698Jul 15, 2026Updated 2 weeks ago
Axellwppr / motion_tracking
View on GitHub
☆369Jul 13, 2026Updated 2 weeks ago
InternRobotics / PhysHSI
View on GitHub
Official implementation of the paper: "PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System"
☆306Oct 14, 2025Updated 9 months ago
BeingBeyond / Being-H
View on GitHub
Being-H is BeingBeyond's family of human-centric embodied foundation models.
☆1,107Jun 16, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TeleHuman / OASIS
View on GitHub
OASIS: From Simulation Data Collection to Real-World Humanoid Loco-Manipulation
☆60Jun 15, 2026Updated last month
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,500Apr 19, 2026Updated 3 months ago
zengweishuai / ScaleBFM
View on GitHub
The official implementation of the paper "Scaling Behavior Foundation Model for Humanoid Robots"
☆143Updated this week
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,691Jul 9, 2026Updated 2 weeks ago
Tsinghua-MARS-Lab / OMG
View on GitHub
Official repository for "OMG: Omni-Modal Motion Generation for Generalist Humanoid Control", https://arxiv.org/abs/2606.10340.
☆95Jul 22, 2026Updated last week
jieyefriic / nbcraft
View on GitHub
Multi-backend image and video generation CLI — Gemini, DashScope (wan), Volcengine Ark (Seedream/Seedance), OpenAI gpt-image-2.
☆155Apr 30, 2026Updated 2 months ago
NVlabs / GR00T-WholeBodyControl
View on GitHub
Welcome to GR00T Whole-Body Control (WBC)! This is a unified platform for developing and deploying advanced humanoid controllers. This in…
☆3,015Jul 17, 2026Updated last week
yuelinou999 / idempotent-pipeline-demo
View on GitHub
A demo project for idempotent pipeline design, duplicate prevention, and retry-safe processing.
☆236Mar 20, 2026Updated 4 months ago
next-1688 / 1688-source-suppliers
View on GitHub
1688找供应商 —— 结合用户需求与关键字查询对应的供应商及工厂信息核心工具能力：1688供应商查询能力。用于查询1688平台上的供应商及工厂信息。触发词：找供应商、查供应商、1688供应商、供应商信息、工厂信息、产业带查询。不触发场景：找商品/选品 → 1688-…
☆552May 7, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OpenMOSS / FRoM-W1
View on GitHub
[arXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
☆182Jun 5, 2026Updated last month
Axellwppr / gentle-humanoid
View on GitHub
GentleHumanoid: Whole body Motion Tracking with Compliance - Inference and Deploy
☆210Dec 17, 2025Updated 7 months ago
next-1688 / 1688-88syt
View on GitHub
88生意通是1688线下B2B交易的得力帮手，一句话搞定全流程操作！无论您是卖家还是买家，只需一句指令，即可轻松完成交易单创建、签署、确认收货、退款等核心操作，全面支持账号状态查询、实名认证、绑卡及交易，让每一步交易流程更清晰、更可控。通过智能化交互，实现交易流程数字化，提…
☆575Mar 27, 2026Updated 4 months ago
HorizonRobotics / HoloMotion
View on GitHub
HoloMotion: A Foundation Model for Whole-Body Humanoid Control
☆604Jul 20, 2026Updated last week
HybridRobotics / Ego-VCP
View on GitHub
Ego-Vision World Model for Humanoid Contact Planning
☆188Dec 24, 2025Updated 7 months ago
Ingrid789 / OmniContact_sim2sim
View on GitHub
Official implementation of the paper: "OmniContact: Chaining Meta-Skills via Contact Flow for Generalizable Humanoid Loco-Manipulation"
☆131Jul 9, 2026Updated 2 weeks ago
mimic-video / mimic-video
View on GitHub
Video-Action Models for Generalizable Robot Control Beyond VLAs
☆293Jun 26, 2026Updated last month
NVIDIA / soma-retargeter
View on GitHub
SOMA BVH to humanoid robot motion retargeting library built with Newton and NVIDIA Warp
☆504Mar 25, 2026Updated 4 months ago
hanlin-afk / rt-cinfer-web
View on GitHub
Real-time causal inference framework for Web Vitals optimization using streaming SCMs, public CrUX/PageSpeed field data, and auditable in…
☆400Jul 2, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
project-instinct / InstinctLab
View on GitHub
☆736Jul 8, 2026Updated 3 weeks ago
starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,316Jul 20, 2026Updated last week
OpenDriveLab / WholebodyVLA
View on GitHub
[ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control
☆530May 25, 2026Updated 2 months ago
amazon-far / TWIST2
View on GitHub
[arXiv 2025] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System
☆858Dec 3, 2025Updated 7 months ago
zixuan417 / humanoid-general-motion-tracking
View on GitHub
The official codebase of paper "GMT: General Motion Tracking for Humanoid Whole-Body Control"
☆430Aug 8, 2025Updated 11 months ago
LeCAR-Lab / FALCON
View on GitHub
[L4DC 2026 (Oral)] "FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation"
☆409Apr 9, 2026Updated 3 months ago
jiyuanwang-afk / Explainable-AI-in-Financial-Fraud-and-Anomaly-Detection
View on GitHub
This repository explores explainable deep learning models for financial fraud detection and anomaly analysis. It integrates Graph Neura…
☆700Oct 20, 2025Updated 9 months ago