LatentActionPretraining / LAPALinks

[ICLR 2025] LAPA: Latent Action Pretraining from Videos

☆387

Alternatives and similar repositories for LAPA

Users that are interested in LAPA are comparing it to the libraries listed below

Sorting:

ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆278Updated 3 months ago
bytedance / GR-1
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
☆285Updated last year
embodiedreasoning / ERQA
Embodied Reasoning Question Answer (ERQA) Benchmark
☆232Updated 7 months ago
Robot-VLAs / RoboVLMs
☆403Updated 9 months ago
OpenMOSS / VLABench
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
☆310Updated 2 months ago
microsoft / CogACT
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
☆362Updated 5 months ago
MichalZawalski / embodied-CoT
Embodied Chain of Thought: A robotic policy that reason to solve the task.
☆312Updated 6 months ago
flow-diffusion / AVDC
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆232Updated last year
Stanford-ILIAD / openvla-mini
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆271Updated 7 months ago
InternRobotics / Seer
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
☆247Updated 3 months ago
OpenHelix-Team / OpenHelix
OpenHelix: An Open-source Dual-System VLA Model for Robotic Manipulation
☆307Updated 2 months ago
baaivision / UniVLA
Unified Vision-Language-Action Model
☆213Updated last week
alibaba-damo-academy / WorldVLA
WorldVLA: Towards Autoregressive Action World Model
☆472Updated 2 weeks ago
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆144Updated 6 months ago
rail-berkeley / bridge_data_v2
☆226Updated last year
NVIDIA / GR00T-Dreams
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆340Updated this week
RoboDita / Dita
ICCV2025
☆135Updated 2 months ago
allenai / molmoact
Official Repository for MolmoAct
☆224Updated last week
EDiRobotics / GR1-Training
Reimplementation of GR-1, a generalized policy for robotics manipulation.
☆143Updated last year
GuanxingLu / vlarl
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆313Updated last month
2toinf / UniAct
[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"
☆204Updated 7 months ago
SpatialVLA / SpatialVLA
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆541Updated 4 months ago
Large-Trajectory-Model / ATM
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
☆256Updated 4 months ago
InternRobotics / InternVLA-M1
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆191Updated last week
PKU-HMI-Lab / Hybrid-VLA
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
☆297Updated 3 weeks ago
TencentARC / Moto
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆137Updated 3 weeks ago
Fanqi-Lin / OneTwoVLA
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
☆191Updated 4 months ago
roboterax / video-prediction-policy
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆283Updated 5 months ago
LostXine / LLaRA
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆225Updated 6 months ago
InternRobotics / VLAC
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆192Updated last month