microsoft/villa-x

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/villa-x)

microsoft / villa-x

☆206

Alternatives and similar repositories for villa-x

Users that are interested in villa-x are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
TencentARC / Moto
View on GitHub
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆179Oct 1, 2025Updated 9 months ago
InternRobotics / PPI
View on GitHub
[RSS 2025] Gripper Keypose and Object Pointflow as Interfaces for Bimanual Robotic Manipulation
☆79Jul 22, 2025Updated 11 months ago
pairlab / QueST
View on GitHub
Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]
☆114Nov 21, 2024Updated last year
cvlab-columbia / videopolicy
View on GitHub
☆63Mar 3, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zhangwenyao1 / DreamVLA
View on GitHub
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆360Jan 6, 2026Updated 6 months ago
InternRobotics / Seer
View on GitHub
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆308Jul 8, 2025Updated last year
wudongming97 / AffordanceNet
View on GitHub
[ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
☆49Nov 21, 2025Updated 7 months ago
DAVIAN-Robotics / ACG
View on GitHub
Code for "ACG: Action Coherence Guidance for Flow-based Vision-Language-Action Models" (ICRA 2026)
☆82Mar 11, 2026Updated 4 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆397Jul 23, 2025Updated 11 months ago
Little-Podi / AdaWorld
View on GitHub
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆250Jun 17, 2025Updated last year
Max-Fu / otter
View on GitHub
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆118Apr 14, 2025Updated last year
UMass-Embodied-AGI / MindJourney
View on GitHub
[NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"
☆151Nov 4, 2025Updated 8 months ago
OpenDriveLab / UniVLA
View on GitHub
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
☆1,099Nov 19, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SJTU-DENG-Lab / Mantis
View on GitHub
[CVPR 2026] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
☆92Jun 5, 2026Updated last month
schmidtdominik / LAPO
View on GitHub
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
☆143Jul 31, 2024Updated last year
ChengYaofeng / franka_handeye_calibration_ros2
View on GitHub
Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.
☆22Jun 4, 2025Updated last year
villekuosmanen / rewACT
View on GitHub
A supervised learning trained reward head for ACT
☆144Apr 21, 2026Updated 2 months ago
pointW / equidiff
View on GitHub
[CoRL 2024 Outstanding Paper Award Finalist] Equivariant Diffusion Policy
☆135Feb 13, 2025Updated last year
facebookresearch / egoman
View on GitHub
The repository provides code for EgoMAN model and dataset creation scripts.
☆31Dec 31, 2025Updated 6 months ago
karthikscale3 / aws-workflow
View on GitHub
AWS World implementation for Workflow DevKit - Run durable workflows on AWS Lambda with DynamoDB, SQS, and S3
☆41Oct 28, 2025Updated 8 months ago
SalesforceAIResearch / FOFPred
View on GitHub
☆39Jun 2, 2026Updated last month
thu-ml / RDT2
View on GitHub
Official code of RDT 2
☆791Feb 7, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
simpler-env / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆1,120Dec 20, 2025Updated 6 months ago
BeingBeyond / Being-H
View on GitHub
Being-H is BeingBeyond's family of human-centric embodied foundation models.
☆1,093Jun 16, 2026Updated 3 weeks ago
TeleeMa / GLOVER
View on GitHub
This is the official code repo for GLOVER and GLOVER++.
☆57Aug 6, 2025Updated 11 months ago
siddhanthaldar / Point-Policy
View on GitHub
Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
☆90Jul 21, 2025Updated 11 months ago
JIAjindou / A2A_Flow_Matching
View on GitHub
Accept by RSS 2026
☆180Jun 1, 2026Updated last month
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆554Jan 22, 2025Updated last year
Universal-Control / ppt_learning
View on GitHub
A unified robotic manipulation learning framework
☆23Sep 4, 2025Updated 10 months ago
Kami-code / HandsOnVLM-release
View on GitHub
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆41Sep 15, 2025Updated 9 months ago
robopen / roboagent
View on GitHub
Repository to train and evaluate RoboAgent
☆373Apr 2, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
behavior-robot-suite / brs-algo
View on GitHub
Official Algorithm Codebase for the Paper "BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household A…
☆170Aug 24, 2025Updated 10 months ago
exla-ai / fla
View on GitHub
library to finetune VLAs
☆61Feb 7, 2026Updated 5 months ago
leroy9472 / InMind
View on GitHub
☆15Nov 18, 2025Updated 7 months ago
apple / ml-egodex
View on GitHub
EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video
☆330Aug 20, 2025Updated 10 months ago
2toinf / DecisionNCE
View on GitHub
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
☆82May 26, 2025Updated last year
peterdavidfagan / mujoco_robot_environments
View on GitHub
Prototyping mujoco simulation environments.
☆11Feb 20, 2025Updated last year
jiangranlv / ScissorBot
View on GitHub
[CoRL 2024] ScissorBot: Learning Generalizable Scissor Skill for Paper Cutting via Simulation, Imitation, and Sim2Real
☆15Dec 25, 2024Updated last year