Mondo-Robotics / DiT4DiTView on GitHub
This is the official code repo for DiT4DiT, a Vision-Action-Model (VAM) framework that combines video generation model with flow-matching-based action prediction for generalizable robotic manipulation.
119Apr 16, 2026Updated this week

Alternatives and similar repositories for DiT4DiT

Users that are interested in DiT4DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?