RoyZry98/MoLe-VLA-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RoyZry98/MoLe-VLA-Pytorch)

RoyZry98 / MoLe-VLA-Pytorch

[AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation

☆70

Alternatives and similar repositories for MoLe-VLA-Pytorch

Users that are interested in MoLe-VLA-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RoyZry98 / MoANT-Pytorch
View on GitHub
[CoLM 2026] Official code for MoANT: Mixture-of-Rank-One-Experts with semantic-aware Intuition for Multi-task Large Language Model Finetu…
☆17May 16, 2025Updated last year
RoyZry98 / MoASE-Pytorch
View on GitHub
🔥 [AAAI 2026 Oral] Official code for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptat…
☆79May 13, 2026Updated 2 months ago
PKU-HMI-Lab / LIFT3D
View on GitHub
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
☆186Jun 20, 2025Updated last year
RoyZry98 / VeCAF-Pytorch
View on GitHub
[MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness
☆52Jul 24, 2024Updated last year
siyuhsu / vla-cache
View on GitHub
[NeurIPS 2025] VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching
☆91Feb 27, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
RoyZry98 / RepCaM-Pytorch
View on GitHub
[TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery
☆54Apr 21, 2025Updated last year
LukeLIN-web / vote
View on GitHub
Vision-Language-Action Optimization with Trajectory Ensemble Voting (ICANN2026)
☆26Feb 18, 2026Updated 5 months ago
gooogleshanghai / ActDistill
View on GitHub
Action-Guided Knowledge Distillation for VLA Models
☆19Dec 16, 2025Updated 7 months ago
OpenHelix-Team / CEED-VLA
View on GitHub
[ECCV 2026] Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
☆51Sep 15, 2025Updated 10 months ago
RoyZry98 / MoFME-Pytorch
View on GitHub
[AAAI 2024] Official code for Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation
☆65Mar 16, 2026Updated 4 months ago
PKU-HMI-Lab / Hybrid-VLA
View on GitHub
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆352Oct 3, 2025Updated 9 months ago
litwellchi / M2Chat
View on GitHub
☆36Feb 6, 2025Updated last year
litwellchi / MMTrail
View on GitHub
[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
☆35Feb 6, 2025Updated last year
microsoft / CogACT
View on GitHub
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆429Oct 30, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
PineTreeWss / SpecVLA
View on GitHub
Implementation of the paper 'Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance' (EMNLP 2025)
☆34Dec 16, 2025Updated 7 months ago
ustcwhy / BitVLA
View on GitHub
Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation
☆161Mar 2, 2026Updated 4 months ago
yulin-luo / RoboBench
View on GitHub
This is the official evaluation code for Robobench
☆22Updated this week
wow-world-model / wow-world-model
View on GitHub
WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…
☆164Jan 4, 2026Updated 6 months ago
RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
kriskrisliu / PAT
View on GitHub
[AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models
☆37Feb 1, 2025Updated last year
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,299Sep 9, 2025Updated 10 months ago
Zhangwenyao1 / DreamVLA
View on GitHub
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆362Jan 6, 2026Updated 6 months ago
Gumpest / SparseVLMs
View on GitHub
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆266Dec 22, 2025Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ai4ce / INT-ACT
View on GitHub
Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
☆33Nov 2, 2025Updated 8 months ago
OpenHelix-Team / HiF-VLA
View on GitHub
[CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model
☆74Mar 11, 2026Updated 4 months ago
CHEN-H01 / Fast-in-Slow
View on GitHub
Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning
☆158Aug 1, 2025Updated 11 months ago
URDF-Anything-plus / Code
View on GitHub
☆44Mar 17, 2026Updated 4 months ago
moka-manipulation / moka
View on GitHub
MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)
☆101Jul 16, 2024Updated 2 years ago
KolaKivy / AFRO
View on GitHub
☆23Jan 26, 2026Updated 5 months ago
baaivision / UniVLA
View on GitHub
[ICLR 2026] Unified Vision-Language-Action Model
☆314Oct 15, 2025Updated 9 months ago
MINT-SJTU / Evo-0
View on GitHub
Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.
☆54Nov 24, 2025Updated 7 months ago
OpenHelix-Team / OpenHelix
View on GitHub
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆387Aug 27, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hhcaz / e2vla
View on GitHub
☆25Oct 18, 2025Updated 9 months ago
Gumpest / FreeKD
View on GitHub
[CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".
☆50Apr 20, 2024Updated 2 years ago
Cognition2Action-Lab / VLA-TMEE
View on GitHub
Reshaping Action Error Distributions for Reliable Vision-Language-Action Models
☆17Feb 5, 2026Updated 5 months ago
OpenHelix-Team / frappe
View on GitHub
Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
☆55Mar 24, 2026Updated 3 months ago
Tencent / VITA
View on GitHub
The official implement of VITA, VITA15, LongVITA, VITA-Audio, VITA-VLA, and VITA-E.
☆162Oct 28, 2025Updated 8 months ago
ZhuoyangLiu2005 / MLA
View on GitHub
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
☆74Nov 10, 2025Updated 8 months ago
foundation-multimodal-models / ConBench
View on GitHub
[NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".
☆39Oct 23, 2024Updated last year