OpenHelix-Team/Spatial-Forcing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenHelix-Team/Spatial-Forcing)

OpenHelix-Team / Spatial-Forcing

Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]

☆262

Alternatives and similar repositories for Spatial-Forcing

Users that are interested in Spatial-Forcing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenHelix-Team / frappe
View on GitHub
Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
☆55Mar 24, 2026Updated 3 months ago
OpenHelix-Team / LLaVA-VLA
View on GitHub
LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]
☆205Mar 12, 2026Updated 4 months ago
ZGC-EmbodyAI / LangForce
View on GitHub
[ICML 2026] This repo is the official implementation of "LangForce : Bayesian Decomposition of Vision Language Action Models via Latent …
☆72Jun 16, 2026Updated last month
OpenHelix-Team / ReconVLA
View on GitHub
Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
☆270Apr 1, 2026Updated 3 months ago
OpenHelix-Team / Unified-Diffusion-VLA
View on GitHub
🔥 The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]
☆185Mar 12, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
moojink / openvla-oft
View on GitHub
Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
☆1,299Sep 9, 2025Updated 10 months ago
OpenHelix-Team / HiF-VLA
View on GitHub
[CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model
☆74Mar 11, 2026Updated 4 months ago
starVLA / starVLA
View on GitHub
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
☆3,237Updated this week
PKU-EPIC / GraspVLA
View on GitHub
[CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data
☆390Dec 29, 2025Updated 6 months ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,632Jul 9, 2026Updated last week
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,185Apr 3, 2026Updated 3 months ago
Zhangwenyao1 / DreamVLA
View on GitHub
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆362Jan 6, 2026Updated 6 months ago
Selen-Suyue / WoG
View on GitHub
[ICML 2026] 🏂 World Guidance: World Modeling in Condition Space for Action Generation
☆157Apr 28, 2026Updated 2 months ago
OpenHelix-Team / VLA-Adapter
View on GitHub
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
☆2,243Mar 19, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenHelix-Team / VLA-RFT
View on GitHub
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
☆162Oct 6, 2025Updated 9 months ago
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,786Jan 6, 2026Updated 6 months ago
X-Square-Robot / wall-x
View on GitHub
Building General-Purpose Robots Based on Embodied Foundation Model
☆1,177Jul 7, 2026Updated 2 weeks ago
SpatialVLA / SpatialVLA
View on GitHub
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
☆706Jun 23, 2025Updated last year
InternRobotics / InternVLA-A-series
View on GitHub
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
☆507Updated this week
Lifelong-Robot-Learning / LIBERO
View on GitHub
Benchmarking Knowledge Transfer in Lifelong Robot Learning
☆2,081Mar 15, 2025Updated last year
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆835Jan 23, 2026Updated 5 months ago
thu-ml / Motus
View on GitHub
Official code of Motus: A Unified Latent Action World Model
☆1,209Jan 5, 2026Updated 6 months ago
LoveJu1y / LaRA-VLA
View on GitHub
[ICML 2026] Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models
☆77May 18, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
2toinf / X-VLA
View on GitHub
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
☆692Jun 10, 2026Updated last month
shihao1895 / MemoryVLA
View on GitHub
[ICLR 2026] Code of "MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation"
☆304Jun 13, 2026Updated last month
FALCON-VLA / FALCON
View on GitHub
[ICLR 2026] 🦅 FALCON: an effective vision-language-action model injects rich 3D spatial tokens into the action head, enabling robust spa…
☆31May 26, 2026Updated last month
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,098Dec 2, 2025Updated 7 months ago
sylvestf / LIBERO-plus
View on GitHub
Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.
☆382Jan 21, 2026Updated 6 months ago
OpenHelix-Team / CEED-VLA
View on GitHub
[ECCV 2026] Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
☆51Sep 15, 2025Updated 10 months ago
LatentActionPretraining / LAPA
View on GitHub
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆560Jan 22, 2025Updated last year
Zxy-MLlab / LIBERO-PRO
View on GitHub
LIBERO-PRO is the official repository of the LIBERO-PRO — an evaluation extension of the original LIBERO benchmark
☆283Updated this week
roboterax / video-prediction-policy
View on GitHub
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io
☆403May 17, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,466Apr 19, 2026Updated 3 months ago
InternRobotics / InternVLA-M1
View on GitHub
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
☆418Feb 11, 2026Updated 5 months ago
RoboTwin-Platform / RMBench
View on GitHub
Memory-Dependent Manipulation Benchmark based on RoboTwin
☆172Jul 14, 2026Updated last week
LogosRoboticsGroup / DeFi
View on GitHub
[ICLR 2026] Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
☆30Apr 26, 2026Updated 2 months ago
BridgeVLA / BridgeVLA
View on GitHub
✨✨【NeurIPS 2025】Official implementation of BridgeVLA
☆192Apr 5, 2026Updated 3 months ago
MINT-SJTU / Evo-0
View on GitHub
Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.
☆54Nov 24, 2025Updated 7 months ago
OpenHelix-Team / CapVector
View on GitHub
☆49May 12, 2026Updated 2 months ago