RoboDita/Dita

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RoboDita/Dita)

RoboDita / Dita

ICCV2025

☆171

Alternatives and similar repositories for Dita

Users that are interested in Dita are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆352Oct 3, 2025Updated 9 months ago
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,114Nov 19, 2025Updated 8 months ago
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆710Jun 23, 2025Updated last year
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆430Oct 30, 2025Updated 8 months ago
TianxingChen / G3Flow
View on GitHub
[CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
☆96Jun 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Selen-Suyue / DensePolicy
View on GitHub
[ICCV 2025] Dense Policy (DSP): Bidirectional Autoregressive Learning of Actions
☆79Jan 14, 2026Updated 6 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆475Apr 14, 2026Updated 3 months ago
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,317Sep 9, 2025Updated 10 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆310Jul 8, 2025Updated last year
mees / calvin
View on GitHub
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
☆964Sep 8, 2025Updated 10 months ago
zhihou7 / dit_policy_vla
View on GitHub
☆16Mar 26, 2025Updated last year
SudeepDasari / dit-policy
View on GitHub
☆164Oct 15, 2024Updated last year
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆180Oct 1, 2025Updated 9 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OpenMOSS / VLABench
View on GitHub
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆454Nov 11, 2025Updated 8 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆561Jan 22, 2025Updated last year
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,129Dec 20, 2025Updated 7 months ago
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆408May 17, 2025Updated last year
YanjieZe / 3D-Diffusion-Policy
View on GitHub
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
☆1,415Oct 17, 2025Updated 9 months ago
YanjieZe / Improved-3D-Diffusion-Policy
View on GitHub
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
☆550Jun 16, 2025Updated last year
juruobenruo / DexVLA
View on GitHub
☆63Apr 15, 2025Updated last year
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆419Feb 11, 2026Updated 5 months ago
thu-ml / RoboticsDiffusionTransformer
View on GitHub
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
☆1,764Jan 21, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆2,106Mar 15, 2025Updated last year
cccedric / conrft
View on GitHub
This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
☆361Mar 30, 2026Updated 3 months ago
ByteDance-Seed / Chain-of-Action
View on GitHub
Official implementation of Chain-of-Action: Trajectory Autoregressive Modeling for Robotic Manipulation. Accepted in NeurIPS 2025.
☆107Dec 13, 2025Updated 7 months ago
Stanford-ILIAD / openvla-mini
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆374Mar 19, 2025Updated last year
WEIRDLabUW / unified-world-model
View on GitHub
Unfied World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets
☆246Oct 8, 2025Updated 9 months ago
Fanqi-Lin / OneTwoVLA
View on GitHub
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆236May 30, 2025Updated last year
intuitive-robots / flower_vla_calvin
View on GitHub
[CoRL 25] Code for FLOWER VLA for finetuning FLOWER on CALVIN and all LIBERO environments
☆94Sep 22, 2025Updated 10 months ago
allenzren / open-pi-zero
View on GitHub
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
☆1,507Jan 31, 2025Updated last year
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,798Jan 6, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cage-policy / CAGE
View on GitHub
[ICRA 2025] CAGE: Causal Attention Enables Data-Efficient Generalizable Robotic Manipulation
☆35Jan 14, 2025Updated last year
yuechen0614 / ET-SEED
View on GitHub
[ICLR 2025🎉] Official implementation for paper "ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy".
☆69Nov 3, 2025Updated 8 months ago
OpenHelix-Team / frappe
View on GitHub
Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
☆55Mar 24, 2026Updated 4 months ago
ucd-dare / VITA
View on GitHub
Flowing from Vision to Action: Noise-Free Flow Matching Policy Learning 🎉[ICLR 2026]
☆135May 14, 2026Updated 2 months ago
fuse-model / FuSe
View on GitHub
☆70Sep 18, 2025Updated 10 months ago
Max-Fu / otter
View on GitHub
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆118Apr 14, 2025Updated last year
A-embodied / A0
View on GitHub
☆77Updated this week