ZhuoyangLiu2005/MLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZhuoyangLiu2005/MLA)

ZhuoyangLiu2005 / MLA

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation

☆74

Alternatives and similar repositories for MLA

Users that are interested in MLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / MS-Bot
View on GitHub
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆22Jun 25, 2025Updated last year
PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆352Oct 3, 2025Updated 9 months ago
jiayueru / Video2Act
View on GitHub
Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling
☆31Jun 24, 2026Updated 3 weeks ago
ZhuoyangLiu2005 / last0
View on GitHub
[ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
☆87Apr 30, 2026Updated 2 months ago
PKU-HMI-Lab / LIFT3D
View on GitHub
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆186Jun 20, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RoboVerseOrg / ViTacFormer
View on GitHub
ViTacFormer: Learning Cross-Modal Representation for Visuo-Tactile Dexterous Manipulation
☆111May 23, 2026Updated last month
XuWuLingYu / WristWorld
View on GitHub
The official code of paper WristWorld.
☆31Nov 8, 2025Updated 8 months ago
OpenGalaxea / GalaxeaVLA
View on GitHub
Galaxea's open-source VLA repository
☆689Jul 11, 2026Updated last week
ZZongzheng0918 / TA-VLA
View on GitHub
CoRL 2025 TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models
☆112Oct 25, 2025Updated 8 months ago
univtac / UniVTAC
View on GitHub
☆117Jun 20, 2026Updated last month
InternRobotics / InstructVLA
View on GitHub
[ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
☆116Jan 27, 2026Updated 5 months ago
GeorgeWuzy / ViTacGen
View on GitHub
Official Repository of ViTacGen: Robotic Pushing with Vision-to-Touch Generation (RA-L 2025 & ICRA 2026)
☆17Feb 5, 2026Updated 5 months ago
CHEN-H01 / Fast-in-Slow
View on GitHub
Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning
☆159Aug 1, 2025Updated 11 months ago
OpenHelix-Team / Spatial-Forcing
View on GitHub
Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
☆263Jul 7, 2026Updated 2 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhuoyangLiu2005 / T-Rex
View on GitHub
Official repository of T-Rex: Tactile-Reactive Dexterous Manipulation
☆191Updated this week
kingchou007 / adaptac-dex
View on GitHub
[IROS 2025] Adaptive Visuo-Tactile Fusion with Predictive Force Attention for Dexterous Manipulation
☆24Apr 7, 2026Updated 3 months ago
gen-robot / RL4VLA
View on GitHub
☆277Aug 25, 2025Updated 10 months ago
xiaoxiaoxh / reactive_diffusion_policy
View on GitHub
[RSS 2025] Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation
☆359Apr 12, 2026Updated 3 months ago
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,787Jan 6, 2026Updated 6 months ago
MiYanDoris / GraspVLA-playground
View on GitHub
☆34Aug 21, 2025Updated 11 months ago
rise-policy / HistRISE
View on GitHub
[ICRA 2026] History-Aware Visuomotor Policy Learning via Point Tracking
☆27Jan 10, 2026Updated 6 months ago
real-stanford / DexUMI
View on GitHub
DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation
☆238Apr 18, 2026Updated 3 months ago
April-Yz / ManiGaussian_Bimanual
View on GitHub
[IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model
☆45Jun 26, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
jingjingqian75 / GeoPredict
View on GitHub
[CVPR2026] GeoPredict: Leveraging Predictive Kinematics and 3D Gaussian Geometry for Precise VLA Manipulation
☆27Jul 6, 2026Updated 2 weeks ago
InternRobotics / CronusVLA
View on GitHub
[AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
☆109Jan 11, 2026Updated 6 months ago
CHEN-H01 / LaST-R1
View on GitHub
LaST-R1
☆104May 6, 2026Updated 2 months ago
BridgeVLA / BridgeVLA
View on GitHub
✨✨【NeurIPS 2025】Official implementation of BridgeVLA
☆192Apr 5, 2026Updated 3 months ago
InternRobotics / F1-VLA
View on GitHub
F1: A Vision Language Action Model Bridging Understanding and Generation to Actions
☆200Jan 2, 2026Updated 6 months ago
YixiangChen515 / EC-Flow
View on GitHub
[ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"
☆27Oct 16, 2025Updated 9 months ago
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆429Oct 30, 2025Updated 8 months ago
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
intuitive-robots / MoDE_Diffusion_Policy
View on GitHub
[ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"
☆125May 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
michaelyuancb / ftp1-policy
View on GitHub
FTP-1: A Generalist Foundation Tactile Policy Across Tactile Sensors for Contact-Rich Manipulation
☆89Jul 15, 2026Updated last week
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆418Feb 11, 2026Updated 5 months ago
RoyZry98 / MoLe-VLA-Pytorch
View on GitHub
[AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Man…
☆70Jul 31, 2025Updated 11 months ago
MasterXiong / Hyper-VLA
View on GitHub
Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"
☆26Oct 8, 2025Updated 9 months ago
Why-peace / F2F-AP
View on GitHub
F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation
☆16Apr 7, 2026Updated 3 months ago
cocacola-lab / TLV-Link
View on GitHub
An official implementation of Touch100k: A Large-Scale Touch-Language-Vision Dataset for Touch-Centric Multimodal Representation
☆34Jun 12, 2024Updated 2 years ago
nuomizai / T2VLM
View on GitHub
[ICCV'25] T2 -VLM: Training-Free Generation of Temporally Consistent Rewards from VLMs
☆16Jul 8, 2025Updated last year