CladernyJorn/UP-VLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CladernyJorn/UP-VLA)

CladernyJorn / UP-VLA

Official PyTorch implementation for ICML 2025 paper: UP-VLA.

☆61

Alternatives and similar repositories for UP-VLA

Users that are interested in UP-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenHelix-Team / frappe
View on GitHub
Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
☆55Mar 24, 2026Updated 4 months ago
Lizhuoling / VIRT
View on GitHub
☆33May 16, 2025Updated last year
liyi14 / HAMSTER_beta
View on GitHub
☆62Apr 18, 2025Updated last year
BridgeVLA / BridgeVLA
View on GitHub
✨✨【NeurIPS 2025】Official implementation of BridgeVLA
☆193Apr 5, 2026Updated 3 months ago
hhcaz / e2vla
View on GitHub
☆25Oct 18, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xiaoxiao0406 / VQ-VLA
View on GitHub
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆133Nov 15, 2025Updated 8 months ago
lzylucy / 4dgen
View on GitHub
[ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆123Jan 10, 2026Updated 6 months ago
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆408May 17, 2025Updated last year
vlc-robot / hiveformer
View on GitHub
☆33Sep 25, 2024Updated last year
AgibotTech / EWMBench
View on GitHub
Official code for EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models
☆129Jun 13, 2025Updated last year
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆419Feb 11, 2026Updated 5 months ago
Robot-VLAs / RoboVLMs
View on GitHub
☆475Apr 14, 2026Updated 3 months ago
yufeiwang63 / ArticuBot
View on GitHub
Official repository for RSS 25 paper: ArticuBot. Project: https://articubot.github.io/
☆41Mar 19, 2026Updated 4 months ago
baaivision / UniVLA
View on GitHub
[ICLR 2026] Unified Vision-Language-Action Model
☆315Oct 15, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,103Dec 2, 2025Updated 7 months ago
ShuangLI59 / unified_video_action
View on GitHub
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆400Jul 23, 2025Updated last year
ai4ce / INT-ACT
View on GitHub
Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
☆33Nov 2, 2025Updated 8 months ago
sihengz02 / RoLA
View on GitHub
[CoRL 2025] Robot Learning from Any Images
☆34Nov 11, 2025Updated 8 months ago
pickxiguapi / Embodied-R1
View on GitHub
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation" (ICLR2026)
☆152Mar 3, 2026Updated 4 months ago
Selen-Suyue / DensePolicy
View on GitHub
[ICCV 2025] Dense Policy (DSP): Bidirectional Autoregressive Learning of Actions
☆79Jan 14, 2026Updated 6 months ago
StoreBlank / KUDA
View on GitHub
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation
☆22Apr 23, 2025Updated last year
LogosRoboticsGroup / 4D-VLA
View on GitHub
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.
☆58Jan 10, 2026Updated 6 months ago
allenai / SimplerEnv
View on GitHub
☆20Sep 2, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
expectorlin / ADAPT
View on GitHub
code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)
☆10Jul 17, 2022Updated 4 years ago
Zhangwenyao1 / DreamVLA
View on GitHub
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆364Jan 6, 2026Updated 6 months ago
RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
CladernyJorn / VLM4VLA
View on GitHub
Implementation of VLM4VLA
☆165Apr 22, 2026Updated 3 months ago
OpenHelix-Team / VLA-2
View on GitHub
VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation
☆32Nov 3, 2025Updated 8 months ago
PKU-HMI-Lab / AC-DiT
View on GitHub
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation
☆48Feb 23, 2026Updated 5 months ago
Biscue5 / EgoScaler
View on GitHub
[CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
☆48Dec 2, 2025Updated 7 months ago
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆403Aug 4, 2025Updated 11 months ago
OpenHelix-Team / Spatial-Forcing
View on GitHub
Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
☆270Jul 7, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LogosRoboticsGroup / DeFi
View on GitHub
[ICLR 2026] Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
☆31Apr 26, 2026Updated 3 months ago
Robert-gyj / Prediction_with_Action
View on GitHub
Official PyTorch implementation for NeurIPS 2024 paper: Prediction with Action.
☆55Jan 4, 2025Updated last year
yunhaif / reflect-vlm
View on GitHub
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation
☆178Jul 17, 2025Updated last year
MichalZawalski / embodied-CoT
View on GitHub
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆411Apr 5, 2025Updated last year
hany01rye / tiger
View on GitHub
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
☆23Nov 18, 2025Updated 8 months ago
Chaoqi-LIU / oat
View on GitHub
[RSS 2026] Ordered Action Tokenization
☆103Updated this week
Fanqi-Lin / OneTwoVLA
View on GitHub
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆236May 30, 2025Updated last year